IJCSIS Vol. 14 No. 6, June 2016 Part 2 
ISSN 1947-5500 


International Journal of 
Computer Science 
& Information Security 







© IJCSIS PUBLICATION 2016 
Pennsylvania, USA 


Indexed and technically co-sponsored by : 



THOMSON 

REUTERS 



ProOuest 



V scientific netmtk 


ReseatvhGATE 


I Member 

Connecting Research Organization 
and Researchers 




SlideShare 




(j Academia.edu 

^ share research 



arXiv.org 


open access 


to the world's 
science 


m 

* **.- >•(* A 




CiteSeer.lST 

Scientific Literature Digital Library 


AUTHOR SERIES 

scirus 

www.scirus.com 



dblp 

computer science bibliography 


v>v 


CiteFactor 

Academic Sdcriiflc Journals 



KNOWLEDGE 

Transforming Research 









IJCSIS 

ISSN (online): 1947-5500 

Please consider to contribute to and/or forward to the appropriate groups the following opportunity to submit and publish 
original scientific results. 

CALL FOR PAPERS 

International Journal of Computer Science and Information Security (IJCSIS) 
January-December 2016 Issues 

The topics suggested by this issue can be discussed in term of concepts, surveys, state of the art, research, 
standards, implementations, running experiments, applications, and industrial case studies. Authors are invited 
to submit complete unpublished papers, which are not under review in any other conference or journal in the 
following, but not limited to, topic areas. 

See authors guide for manuscript preparation and submission guidelines. 

Indexed by Google Scholar, DBLP, CiteSeerX, Directory for Open Access Journal (DOAJ), Bielefeld 
Academic Search Engine (BASE), SCIRUS, Scopus Database, Cornell University Library, ScientificCommons, 
ProQuest, EBSCO and more. 

Deadline: see web site 
Notification: see web site 
Revision: see web site 
Publication: see web site 


Context-aware systems 

Networking technologies 

Security in network, systems, and applications 

Evolutionary computation 

Industrial systems 

Evolutionary computation 

Autonomic and autonomous systems 

Bio-technologies 

Knowledge data systems 

Mobile and distance education 

Intelligent techniques, logics and systems 

Knowledge processing 

Information technologies 

Internet and web technologies 

Digital information processing 

Cognitive science and knowledge 


Agent-based systems 
Mobility and multimedia systems 
Systems performance 
Networking and telecommunications 
Software development and deployment 
Knowledge virtualization 
Systems and networks on the chip 
Knowledge for global defense 
Information Systems [IS] 

IPv6 Today - Technology and deployment 
Modeling 

Software Engineering 

Optimization 

Complexity 

Natural Language Processing 
Speech Synthesis 
Data Mining 


For more topics, please see web site https://sites.google.com/site/ijcsis/ 

arXiv.org Google scholar SCirus ^L n ^— OScribd ffeSSES 


BASE 

EMefeid |i.o5d«fnic Search Cnonne 


CiteSeerl™ 


Cu .uni-trier, de 

H I 

,Q I Computer Science 
T3 I Bibliography 


DOAJ 


DIRECTORY OF 
OPEN ACCESS 
JOURNALS 



Pro 



For more information, please visit the journal website (https://sites.google.com/site/ijcsis/) 


Editorial 

Message from Editorial Board 


It is our great pleasure to present the June 2016 issue (Volume 14 Number 6 Part 1, 2 & 3) of 
the International Journal of Computer Science and Information Security (IJCSIS). High 
quality research, survey & review articles are proposed from experts in the field, promoting insight 
and understanding of the state of the art, and trends in computer science and technology. It 
especially provides a platform for high-caliber academics, practitioners and PhD/Doctoral 
graduates to publish completed work and latest research outcomes. According to Google Scholar, 
up to now papers published in IJCSIS have been cited over 6390 times and the number is quickly 
increasing. This statistics shows that IJCSIS has established the first step to be an international 
and prestigious journal in the field of Computer Science and Information Security. There have 
been many improvements to the processing of papers; we have also witnessed a significant 
growth in interest through a higher number of submissions as well as through the breadth and 
quality of those submissions. IJCSIS is indexed in major academic/scientific databases and 
important repositories, such as: Google Scholar, Thomson Reuters, ArXiv, CiteSeerX, Cornell’s 
University Library, Ei Compendex, ISI Scopus, DBLP, DOAJ, ProQuest, ResearchGate, 
Academia.edu and EBSCO among others. 

On behalf of IJCSIS community and the sponsors, we congratulate the authors and thank the 
reviewers for their outstanding efforts to review and recommend high quality papers for 
publication. In particular, we would like to thank the international academia and researchers for 
continued support by citing papers published in IJCSIS. Without their sustained and unselfish 
commitments, IJCSIS would not have achieved its current premier status. 

“We support researchers to succeed by providing high visibility & impact value, prestige and 
excellence in research publication.” For further questions or other suggestions please do not 
hesitate to contact us at iicsiseditorOamail. com . 

A complete list of journals can be found at: 

http://sites.qooale.com/site/iicsis/ 
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1. PaperlD 31051608: ESSPI: Exponential Smoothing Seasonal Planting Index, a New Algorithm for Prediction 
Rainfall (pp. 1-9) 

Kristoko D. Hartomo, Faculty of Information Technology, Satya Wacana Christian University, Salatiga, Indonesia 
Subanar, Faculty of Mathematics and Natural Sciences, GadjahMada University, Yogyakarta, Indonesia 
Edi Winarko, Faculty of Mathematics and Natural Sciences, GadjahMada University, Yogyakarta, Indonesia 

Abstract — Exponential smoothing algorithm is a prediction algorithm recommended by the Food and Agriculture 
Organization. The weakness of exponential smoothing prediction algorithm is low accuracy for the prediction of long- 
term and ineffective in determining the value of smoothing to minimize error. The proposed research is to build a 
model rainfall prediction using a new algorithm Seasonal Planting Index (ESSPI). By using the algorithm planting 
seasonal index, rainfall prediction model will generate higher accuracy. The results showed seasonal planting method 
is the dominant index (5 of 6 test size) have an average accuracy is better than the method of exponential smoothing. 
Index planting seasonal prediction accuracy of 95.73% better than the exponential smoothing a = 0.1 by 56.55%, and 
exponential smoothing of a = 55.53. Novelty of this research is new algorithms for classifying data based on seasonal 
planting index, a new algorithm for determining the smoothing (value), the new fitting algorithm using seasonal 
planting index, and new algorithms using seasonal rainfall prediction planting index for the determination of the 
growing season. 

Keywords — exponential; smoothing; algorithm; seasonal planting index; predictions; accuracy; rainfall; novelty 


2. PaperlD 31051609: A New MultiPathTCP Flooding Attacks Mitigation Technique (pp. 10-15) 

Adwan Yasin, Department of Computer Science, Arab American University, Jenin, Palestine 
Hamzah Hijawi, Department of Computer Science, Arab American University, Jenin, Palestine 

Abstract — MPTCP is a new protocol proposed by IETF working group as an extension for standard TCP, it adds the 
capability to split the TCP connection across multiple paths. It provides higher availability and improves the 
throughput between two multi-address endpoints. Many Linux distributions have been developed to support MPTCP, 
most of them are open source which can be modified and compiled to support different experimental scenarios. 
Splitting the single path TCP connection across multiple paths adds new challenges in paths management and raises 
new security threats. Some of these threats include flooding and hijacking attacks performed by on-path and offpath 
attackers. In this article, we propose a new algorithm to mitigate the flooding and hijacking attacks in MPTCP, the 
proposed method allows a stateful processing of the initial SYN message and it’s following SYN JOIN messages. 

Keywords — TCP, MPTCP, flooding, hijack, on-path, off-path, flooding, DoS 


3. PaperlD 31051613: Temporal Performances Evaluation of Multi-Robot Demining System Inspired by Ant 
Behavior (pp. 16-24) 

Riadh SAAIDIA, Mohamed Sahbi BELLAMINE, Abdessattar BEN AMOR 

Computer Laboratory for Industrial Systems (LISI), National Institute of Applied Sciences and Technology 
(University of Carthage), INSAT, TUNISIA 

Abstract — In this paper we adopt a cooperative strategy based on ACO (Ant Colony Optimization) algorithms to 
coordinate a Multi Robots System (MRS). Our principal objective is to evaluate temporal performances for this system 
by choosing demining operations as a benchmark problem. In this work, we try to adapt the ACO algorithm parameters 
for different mine distribution in order to reduce time demining operations. In particular, we report effects of 
evaporation pheromone rate model and minefield configuration on temporal performances. 


Index Terms — ACO algorithms, multi-robot system (MRS), evaporation pheromone rate, demining system. 


4. PaperlD 31051614: Towards Developing a Cost Effective Solution for Environmental Monitoring (pp. 25- 
28) 

Muhammad Soban Khan, Ans Ali Raza, Zeeshan Musawar, Shoaib Hassan, Taimoor Hassan 

Department of Computer Science, COMSATS Institute of Information Technology, Sahiwal, COMSATS Road off GT 

road, Sahiwal 57000, Pakistan 

Abstract - Environment refers to everything that surrounds a person. Environment contains many types of pollution. 
Most dangerous pollution is air pollution. Most important factor that causes human health is air pollution. Many 
countries are suffering from air pollution. There are many factors that cause air pollution. Some major factors are 
smoke, carbon monoxide and high temperature. Many developing countries are creating solutions for detecting and 
analyzing the air pollution. The main idea of our research is based on proposing a cost effective solution for 
environmental detection. Our system is a connection between sensors, Raspberry Pi, Microsoft Azure and Android 
Mobiles. Raspberry Pi gets environmental values with help of Raspberry Pi and sends the data to Microsoft Azure 
through API, form where Android Mobile gets those values with the help of HTTP request. Our proposed system 
successfully detected temperature, humidity, hydrogen, methane, propane, carbon monoxide and air level. The results 
show that our system is most cost effective, secure and easy to use. It will helpful in saving lives. 

Keywords: Environment Pollution, Environmental monitoring system, Raspberry Pi, Air pollution 


5. PaperlD 31051615: AV Encryption Algorithm to Protect Audio visual Content for IPTV (pp. 29-39) 

Muhammad Akram, C. A. Rahim, Amjad Hussain Zahid 

The Institute of Management Sciences (PAK-AIMS), 54660 Lahore, Pakistan 

Abstract — Crypt analytical techniques for multimedia technologies particularly audio visual applications have shown 
some existing flaws while maintaining the security and computational time. This case study is a representative 
algorithm especially for protection of IPTV contents. The network's reliability and security of contents is the major 
issue in IPTV media business. The proposed algorithm is the Audio Video MPEG file encryption technique in which 
the synchronization between audio and video and the frame sequence is shuffled before the transmitting end or vertical 
device. . The shuffling process is guided by input key frames to point out frame positions. The MPEG video frames 
are first extracted via spatial pyramid kernel. It divides the stream into regions over different scales and to find out the 
frame similarity while on merging of AV frames. Then ciphers are implemented to locate the shuffled frames and 
further genetic algorithm such as AES is used to encrypt. By this way, AV contents of IPTV can be secure from 
malicious users. 

Keywords— MPEG, IPTV, CAS, DRM, DES, AES 


6. PaperlD 31051616: Secure Speaker Biometric System using GFCC with Additive White Gaussian Noise and 
Wavelet Filter (pp. 40-47) 

Gaganpreet Kaur, Deptt. of CSE, I.K Punjab Technical University, Punjab, India 

Dr. Dheerendra Singh, Deptt. of CSE, Chandigarh College of Engineering and Technology, Sector-26, Chandigarh, 
India 

Abstract — Speaker Identification (SI) aims to identify the speaker’s identity from the given list of speakers. Speaker 
identification is efficient under the clean training and testing environment conditions. In real environment application, 
there occurs mismatch between training and testing environments due to background noise, which degrades the 
system’s performance and security. So, robust speaker identification is the important issue in research. This paper 


describes the recently used front end algorithm based on Gammatone Frequency Cepstral Coefficients (GFCC) along 
with speech detection algorithm and Cepstral mean normalization (CMN). System makes model using Gaussian 
Mixture Model (GMM) Classifier, which uses iterative Expectation Maximization (EM) Algorithm to estimate the 
Gaussian model parameters. Training data is taken in clean environment and all test utterances are corrupted by adding 
White Gaussian Noise (AWGN). This paper aims to improve the robustness of speaker identification even when 
additive noise is added during testing phase. For improvement Wavelet Filter is implemented to de-noise the speech 
signal. Experiment is carried out in real database oriented and stored database oriented relative to the Attendance 
System application. Experiment is carried on 100 speakers saying phrases like ‘Yes mam’ “present mam”, ‘Yes sir’, 
‘present sir’ with 4 types of utterances for each phrase (so database includes 400 utterances). Experiment results 
obtained shows better performance in noisy environment. The results for stored database oriented experiment show 
that the algorithm gives 85% of Correct Recognition Rate (CORR) while using wavelet filter and 73% without using 
the filter. The results for real database oriented experiment shows 74% of identification rate while using wavelet filter 
and 45% without using the filter. 

Keywords — Gammatone Frequency Cepstral Coefficients (GFCC); Gaussian Mixture Model (GMM); Cepstral mean 
normalization (CMN); Robust Speaker Identification, Additive White Gaussian Noise (AWGN); Wavelet Filter. 


7. PaperlD 31051620: A Novel Algorithm for Load Balancing using HBA and ACO in Cloud Computing 
Environment (pp. 48-52) 

Seyed Majid Mousavi, University of Debrecen, Faculty of Informatics, Debrecen, Hungary 
Fazekas Gabor, University of Debrecen, Faculty of Informatics, Debrecen, Hungary 

Abstract — Cloud computing is an emerging technology and new trend for computing based on virtualization of 
resources. Scheduling of tasks to reach load balancing is a challenge in cloud environment. Load balancing is the 
process of distribution of the load among VMs in order to efficiently utilize of resources and avoiding the situation 
where some VMs are overloaded or idle. Load balancing of non-preemptive tasks is one of the critical issues in task 
scheduling in clouds environment. To improve throughput at cloud resources, an intelligent and dynamic load 
balancing can significantly increase cloud’s performance and minimize the costs. Although, many algorithms, 
strategies and methods have been proposed, but load balancing is still one of the challenging issues in resource 
allocation in cloud computing environment. In this paper we propose a novel load balancing strategy using Honey 
Bees and Ant Colony behavior algorithms in cloud environment. The proposed algorithm strives to balance the load 
of the virtual machines, trying to minimize the completion time of given tasks and reduce response time in cloud 
infrastructure. 

Keywords: load balancing, ant colony, honey bee, cloud computing. 


8. PaperlD 31051621: Route Optimization in MANET Using Hopfield Neural Networks: MANET-HOP (pp. 
53-59) 

Sanjeev Gangway Department of Computer Application, V. B. S. Purvanchal University, Jaunpur, India 
Dr. Krishan Kumar, Department of Computer Science, Gurukul Kangri University, Haridwar, India 

Abstract — As we know that Mobile Ad Hoc Network is the combination of nodes having unstable setup which usually 
formed instantly in independent manner. It does not have any centralized administration. Moreover they don’t have 
any permanent setup and routers. In such situations routing becomes the responsibility of individual nodes and also 
routing is equally important to realize the practical benefits of MANET. Traditional protocols of MANET: DSR, 
AODV, DSDV, OLTP work well but still need improvements time-to-time as per the new issues like QoS provisioning 
and routing. Above protocols mainly depends on hop count measurement. In this paper we have implemented a 
specific problem of six nodes situated at different locations with primary goal to find the shortest route visiting each 
node at least once which is based on the concept of Travelling Salesman Problem using Feedback/Hopfield Neural 
Network. And we found that Hopfield networks are suitable to find the shortest route. 


Keywords- Mobile ad-hoc network, Hopfield neural network, Travelling salesman problem, Route optimization 


9. PaperlD 31051629: A Modified Black hole-Based Task Scheduling Technique for Cloud Computing 
Environment (pp. 60-67) 

Fatemeh Ebadifard, Department of computer, Iran University of science and technology, Tehran, Iran 
Zeinab Borhanifard, Department of computer, Qom University, Qom, Iran 

Ahmad Akbari, Department of computer, Iran University of science and technology, Tehran, Iran 

Abstract — The issue of scheduling is one of the most important ones to be considered by providers of the cloud 
computing in the data center. Using a suitable solution lets the providers of cloud computing use the available 
resources more. Additionally, the satisfaction of clients is met through provision of service quality parameters. Most 
of the solutions for this problem aim at one of the service quality factors and in order to achieve this goal, variety of 
methods are used. Using the algorithm of modified black hole in this paper, a proper solution is presented to tackle 
the problem of scheduling the affairs in cloud environment. The proposed method reduces makespan, increases degree 
of load balancing, and improves the resource's utilization by considering the capability of each virtual machine. We 
have compared the proposed algorithm with existing task scheduling algorithms. Simulation results indicate that the 
proposed algorithm makes a good improvement regarding the makespan and amount of resource utilization compared 
to schedulers based on Random assignment and particle swarm optimization Algorithms. 

Keywords- cloud computing; task scheduling; Black hole; makespan; resource utilization. 


10. PaperlD 31051631: A Multicast Routing Protocol Based on ODMRP with Stable link in Mobile Ad Hoc 
Networks (pp. 68-75) 

Ebrahim Asadi, Department of Computer Engineering, Shabestar Branch, Islamic Azad University, Shabestar, Iran 
Ali Ghaffari, Department of Computer Engineering, Shabestar Branch, Islamic Azad University, Shabestar, Iran 

Abstract — Mobile ad hoc networks are more flexible than tradition networks since they do not require fixed 
infrastructure and allow all nodes move in a random trajectory, which leads frequent rerouting and degrades network 
performance. So, an important issue in mobile computer network research is routing in mobile ad hoc networks. 
Multicast sending is one of the methods used for routing in mobile ad hoc networks because of its group activities. 
However, some problems exist in multicast sending. For example, when receiver nodes attempt to send 
acknowledgments or path repetition packets simultaneously, crashes may occur, which leads to packet loss. On the 
other hand, link expiration is another reason for packet loss. In this study, a multicast routing protocol is offered, 
which uses a combination of two parameters of the received signal’s power and the remaining energy to estimate the 
stability of the link. SINR is used at each node in conjunction with various transmitters to determine a reliable path 
that reduces link failure and end-to-end delay. The aim is to find the best link with probability of the highest life cycle 
for each path. Simulation results of the proposed method using NS-2 simulator indicate the good performance of IMP- 
ODMRP measures in packet delivery rate, end-to-end delay, packet loss rate, and packet collision rate. 
Keywords-Mobile ad hoc networks; multicast; routing; IMP-ODMRP protocol; Standard ODMRP; Stable Link. 


11. PaperlD 31051639: A Survey on Human Social Phenomena inspired Algorithms (pp. 76-81) 

Thanh Tung Khuat, My Hanh Le 

DATIC Laboratory, IT Faculty, University of Science and Technology - The University of Danang, Vietnam 

Abstract — The problem of seeking the optimal solution in the field of science and engineering has been becoming 
complex and challenging due to the explosion of dimensions and the interdependence of variables. Over the past few 
decades, a variety of new concepts, techniques and computational applications inspired from nature have been 
proposed and used to deal with a wide range of optimization problems in diverse fields. Many of nature-inspired 
algorithms generate high-quality solutions for real-world optimization tasks. Nevertheless, the majority of these 


methods are inspired by either biological phenomena or social behaviors of mainly animals and insects. There are few 
works relied on social phenomena of human being used to form optimization algorithms. This paper aims at presenting 
an adequate review of most predominant and successful groups of optimization approaches based on human social 
phenomena. 

Index Terms — Human Social Phenomena, Society Civilization Algorithm, Cultural Algorithms. Teaching-learning- 
based Optimization, Social Learning Algorithm, Alliance Formation based Algorithms, Social Emotional 
Optimization Algorithm, Social Labeling. 


12. PaperlD 31051641: Mammogram Classification Using Selected GLCM Features and Random Forest 
Classifier (pp. 82-87) 

Vibhav Prakash Singh, Ayush Srivastava, Devang Kulshreshtha, Arpit Chaudhary, Rajeev Srivastava 
Department of Computer Science & Engineering, Indian Institute of Technology (BHU), Varanasi, Uttar Pradesh- 
221005, India 

Abstract - Early diagnosis of breast cancer can improve the survival rate by detecting the cancer at initial stage. 
Mammogram is a low dose X-ray image of the breast region, used to diagnose the breast cancer at early stage. In this 
paper, an efficient computer added diagnosis (CAD) system is proposed, automatically detects the normal and 
abnormal images of mammogram. The proposed pre-processing steps include, cropping of mammograms (for 
avoiding the pectoral muscle, unwanted tags) and suppression of Gaussian noise. Further, gray level co-occurrence 
matrix (GLCM) based statistical texture feature from different distances of neighboring and angles are extracted. 
Furthermore, most relevant features are also examined using AdaBoost feature selection method. Finally, normal and 
abnormal mammograms are classified using Random forest (RF) classifier. Experiments on benchmark 
mammography image analysis society (MIAS) database confirm the effectiveness of this work. 

Keywords-CAD; Mammography; GLCM features; Feature selection; Random forest classifier. 


13. PaperlD 31051643: Enhancement of Intrusion-Detection System in MANETs with the Digital Signature via 
Elliptic Curve Cryptosystem (pp. 88-94) 

K. Spurthi, T. N. Shankar, S. Sabari Giri Murugan 
Computer Science & Engineering, KL University, AP, India 

Abstract- The watchdog scheme is popular in MANET to defend the malicious attacks, but the major pitfall of this 
method is unable to detect some destructive actions. The technique Enhanced adaptive acknowledgment EAACK is 
designed to handle some weaknesses as false misbehavior, limited transmission power, and receiver collision of the 
watchdog scheme that is not fully efficient to resolve all the problems. This paper focuses intrusion detection system 
on MANETs with the collaboration of three IDS approach and with the techniques ACK, 2-ACK, and misbehavior 
report identification MRI. This paper proposes digital signature with Elliptic Curve Cryptosystem to avoid forging 
acknowledgment packets from attackers. 

Keywords: DSR, MANET, AOMDV, watchdog, ACK, 2-ACK, MRI. 


14. PaperlD 31051644: P-Method: Improving AODV Routing Protocol for Against Network Layer Attacks in 
Mobile Ad-Hoc Networks (pp. 95-103) 

Shahram Zandiyan, Department of Computer Engineering, Ardabil branch, Islamic Azad University, Ardabil, Iran 
Reza Fotohi, Department of Computer Engineering, Germi branch, Islamic Azad University, Germi, Iran 
Marzieh Koravand, Department of Computer Engineering, Germi branch, Islamic Azad University, Germi, Iran 


Abstract — Mobile ad hoc networks are regarded as a group of networks consisted of wireless systems which 
developing together a network with self-arrangement capability, no constant communication infrastructure and use 
central nodes to communicate with other nodes. Despite lots of advantages, these networks face severe security 
challenges, since their channels are wireless and each node is connected to central node. One of these concerns is the 
incidence of network layer attacks (Black and worm hole attack) is one kind of routing disturbing attacks and can 
bring great damage to the network. In this attack, an attacker cheats nodes, absorbs their packets and then deletes 
them. Hence, black hole and wormhole disrupts communication, or even makes it impossible in some cases. In this 
paper, we proposed P-Method for against network layer attacks in mobile Ad-Hoc networks based on hop count and 
RTT test. The proposed algorithm is implemented in ns2.35 environments and is compared with AODV And DSR 
under attacks, and improved AODV in different scenarios. Simulation results revealed that the (P-method), is better 
than AODV And DSR under attack in terms of packet dropped, packet loss, throughput, and jitter. 

Keywords- Mobile ad hoc networks, AODV and DSR routing protocol, Black hole attack, Worm hole, P-Method. 


15. PaperlD 31051653: Check the Use of Raise in Wireless Sensor Networks Based on Heuristic Algorithms 
Along with Soft Computing Approach (pp. 104-119) 

Abolfazl Akbari, Department of Computer Engineering, Ayatollah Amoli Branch, Islamic Azad University, Amol, 

Iran 

Pourya Khodabandeh, Marlik Higher Education Institute, Nowshahr, Iran 

Ali Khosrozadeh, Department of Computer Engineering, Ayatollah Amoli Branch, Islamic Azad University, Amol, 
Iran 

Abstract - The use of Wireless Sensor Networks (WSNs) has grown dramatically in recent decades, and the use of 
these networks in the areas of military, health, environment, business, etc. increases every day. A wireless sensor 
network consists of many tiny sensor nodes with wireless communications and work independently. In applications 
of such sensor nodes, hundreds or even thousands of low-cost sensor nodes are dispersed over the monitoring area, in 
which each sensor node periodically reports its sensed data to the base station (sink). Due to limitations in the 
communication range, sensor nodes transmit their sensed data through multiple hops. Each sensor node acts as a 
routing element for other nodes for transmitting data. One of the most important challenges in designing such networks 
is the management of energy consumption of nodes; because replacing or charging the batteries of these nodes are 
usually impossible. One of the main characteristics of these networks is that the network lifetime is highly related to 
the route selection. Unbalanced energy consumption is an inherent problem in WSNs characterized by the multi-hop 
routing and many-to-one traffic pattern. This uneven energy dissipation in many routing algorithms can cause network 
partition because some nodes that are part of the efficient path are drained from their battery energy quicker. To 
efficiently route data through transmission path from node to node and to prolong the overall lifetime of the network, 
In this thesis we proposed three new routing algorithms using a combination of both Fuzzy approach and A-star 
algorithm seeks to investigate the problems of balancing energy consumption and maximization of network lifetime 
for WSNs : A-Star with 3 parameters fuzzy system (A*3F), A-Star with 3 fuzzy system with 2 parameters using 
majority vote (A*3FMV) and A-Star with 3 fuzzy system with 2 parameters using simple additive weighting 
(A*3FSAW). The new methods is capable of selecting optimal routing path from the source node to the sink by 
favoring the highest remaining energy, minimum number of hops, lowest traffic load and energy consumption rate. 
We evaluate and compare the efficiency of the proposed algorithms with each other methods under the same criteria 
in four different topographical areas. Simulation results show that A*3PFSAW and A*3PFMV balances the energy 
consumption well among all sensor nodes and achieves an obvious improvement on the network lifetime that randomly 
scattered nodes and flat routing. 

Keywords: Wireless Sensor Networks, A-Star algorithm, Fuzzy logic, Network lifetime, Multi-hop routing. 
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Abstract — To reduce network congestion and to guarantee a certain level of Quality of Service (QoS) for service 
requests, Call Admission Control (CAC) as a part of Radio Resource Management (RRM) aims to accept or reject a 
call based on available resources. In this paper, we proposed new CAC and resources allocation schemes for Long 
Term Evolution (LTE). The proposed CAC scheme gives the priority of Handoff Calls (HC), without totally neglecting 
the requirements of a New Calls (NC). The main objective of this approach is to provide QoS and to prevent network 
congestion. Simulation results show that the call admission control scheme leads to increased session establishment 
success and resource utilization compared with existing admission control and resources allocation schemes. 
Moreover, the resources allocation scheme achieves a considerable gain in the system throughput and fairness. 

Keywords — Call admission control; QoS; Scheduling; LTE; Uplink; Throughput. 
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Abstract — Facebook is becoming very popular as millions of users are sharing their thoughts by using various data 
formats. The motive behind its launch was to find old friends and relatives and make new friends. All Social Networks 
need to meet the increasing user demands of data storage and retrieval. The Social Networks are based on cloud to 
deal with dynamic speed of data generation. The success of Facebook has resulted in increased user traffic and large 
amount of data is continuously generated by its users’. It requires novel ways of storing data and removal and removal 
of duplicates as much as possible while maintaining the speed of responding to a query. In this paper, an attempt is 
made for the identification of data duplication and its removal. Social networking sites need dynamic data management 
by identifying duplicate data and its deletion technique. The removal of duplicate data is necessary, not only to reduce 
runtime, but also to improve search accuracy and efficiency. The implementation of this method reduces the indexing 
time to a great extent by decreasing the collection length, resulting in the reduction of the amount of hardware required 
to support the system. 

Keywords- Hashing; indexing; similarity checking; unique documents; detecting replicate; data duplicity; web 
mining; Facebook. 
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Abstract — The excessive or irrational use of drugs categorized as Proton Pump Inhibitor (PPI) was indicated in Baptis 
Hospital of Kediri, Indonesia. In the PPI-based drug regimen among patients with digestive disorders from December 
2009 to February 2010, many cases that the PPI-based drug regimen was not in accordance with the prevailing 
procedures were found, i.e. the drug regimen among patients who should not be given it. In this study, a method was 
developed to generate the PPI-based drug regimen rule. Data on the PPI-based drug regimen were trained using 
Learning Vector Quantization (LVQ) algorithm. The results of LVQ were stored as new data, which were extracted 
into IF-THEN rule with C4.5 algorithm. Based on the test, eighteen rules were generated for the PPI-based drug 
regimen with an accuracy rate of 82.5% on test data. 


Keywords— PPI-based drug regimen; rule generation; LVQ; C4.5 
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Abstract - Management Information Systems is the process of transforming the accumulated data into useful and 
helpful information systems. This paper work is on design and construction of Advanced Pathology Management 
System (APMS). The objectives of the APMS is to i) Well-secured login system ii) Simple and easy patient registration 
form iii) Better test processing system i.e scheduling for the test and tracking the reports iv) Efficient Report 
Management system i.e, creation, searching and verification of the required reports v) Well-defined privacy 
management systems. The developed APMS is tested over Urgent care hospital, New Delhi. The event logs of 
outpatients are accumulated from the hospital and preprocessed using process mining approaches. Performance indices 
such as wait time for consultation wait time for test and the aggregate time spent on the outpatient care are analyzed. 
Experimental results prove the efficiency of the developed Advanced Pathology Management System (APMS). 

Keywords: Management Information Systems, Clinical Pathology, Report Management, Outpatients and Process 
mining approaches. 
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Abstract - Sensor nodes covers surrounding area and report any events to a base station over multi-hop communication. 
The base station plays a key role in the network. The adversary, wants to disrupt network operation, would excitedly 
look for the base station and target it with attacks in order to inflict maximum damage. To avoid maximum damage a 
novel approach is proposed for boosting the anonymity of the base station. In the proposed research the numbers of 
base stations are increased from one to many (such as 2 to 5) in the network operation. The purpose is to divert the 
adversary attention about the base station and adversary considers the base station as a sensor node. Experimentation 
results suggest that the approach provide a backup facility in case if one of the base stations is failed due to adversary 
or due to energy failure. Therefore enhances network security. 

Keywords - Anonymity, Base Station, Backup Base Station, Wireless Sensor Network 
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Abstract - In the recent times, the demands of Wireless Sensor Networks (WSN) increase the challenges in terms of 
scalability and energy efficiency. One of the key challenges in the wireless sensor network is how to prolong the 
lifetime of the network. To improve the lifetime of the sensor, static and movable mobile sinks are deployed. Movable 
sinks are used to receive sensed data from the sensor where it is located. The static mobile sinks act as a trusted third 
party for computing and distributing keys between sensor nodes and the clusters. It is not necessary to chose new 


cluster head often because of trusted third party sink, performs all the computations of cluster head. The energy is 
retained when computation is reduced in cluster head thereby increases the life time of the particular cluster. Feed 
forward Back propagation algorithm is proposed using adaptive learning in neural networks followed by link aware 
routing. This algorithm deals with fault tolerant backbone tree construction for data transmission whereas it produces 
optimal path for the sink to transmit data. Since the optimal path is established, the life of the sink also to be prolonged 
thereby increase the overall network lifetime. Result shows that the lifetime of the network is improved and energy 
depletion is reduced. 

Keywords - Sensor Networks, mobile sink, clusters 
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Abstract — Software development effort estimation is the process of predicting the effort required to develop a 
software system. Estimating development effort accurately in the early stage of software life cycle plays a crucial role 
in effective project management. Effort estimation is a key factor for software project success, defined as delivering 
software of agreed quality and functionality within schedule and budget. Traditionally effort estimation has been used 
for planning and tracking project resources. It has become an important task. This paper proposed a neural network 
model for software effort estimation. This model has 3 layers. The train, validation and test data used are from 
COCOMO data set. Inputs and targets data randomly divided in train (60 %), validation (20%) and test (20%) group. 
When the number of neurons in hidden layer was 20, Number of training samples was 37, number of validation 
samples was 13 and number of testing samples was 13, the network has best performance. In this case, the value of 
training, validation and testing MSE was 0.01044, 0.0475 and 0.0375 respectively and value of training, validation 
and testing R was 0.9167, 0.7741 and 0.7410 respectively. 

Keywords- Software Engineering, Effort Estimation, Artificial Neural Network 
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Abstract — Forgery detection is the most important task in our national judicial system and criminal investigation 
procedure. Today digital images have become powerful source of communication. With the advancement of 
technology, it becomes very easy to change the content of digital images. Due to which these images are no more 
taken as a proof of authenticity or legitimacy. In this paper, we deal with the widely used form of image tampering 
known as image composition(or image splicing). We demonstrate an effective algorithm to detect the spliced images 
based on illumination inconsistencies present in images. An adaptive support vector machine (a- SVM) is used to 
classify the given images as either genuine or forged. 

Keywords — Digital image forensic, forgery detection, image splicing, Adaptive SVM. 
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Abstract — Due to advancement in technology it is easy to modify the digital images and the discovery of modified 
images can be the difficult task as the images are the very powerful source of communication in every field. So, one 
of the major issue in today’s world regarding digital images is the authenticity of given images. Therefore, digital 
image forgery detection is a growing research field with important implication for ensuring the credibility of digital 
images. In this research, we proposed a credible method to detect image splicing based on illuminant color. Artificial 
neural network techniques are implemented as a classifier to detect the tampered images. The results describe that 
artificial neural network is effective to detect tampered images. 

Keywords — Forgery Detection, Image splicing, Illuminant color, Artificial Neural network. 
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Abstract - This paper surveys various possibilities for pattern matching in compressed big data volume. Although 
various compression standards are available for compressing data, entire volume decompression is compelled before 
pattern matching, this in turn leads to increase in computational complexity as well as the space complexity. Some 
compressions algorithms give better compression ratio, at the same time, they are inefficient in decompression 
required for pattern matching. This paper evaluates the possibilities of pattern matching after compression without 
decoding. Also this paper experiments and proposes how the random sampling and its statistics will help to make 
better compression ratio in big data. The another objective of this work is to investigate the possibilities of pattern 
matching in big data without decoding and some of the standards are suggested based on this study and survey. 

Keywords - Compression, Encoding, Decoding, Big data, compression ratio, computational complexity, space 
complexity, random sampling. 
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Abstract — Data grids provide large-scale geographically distributed data resources for data intensive applications. 
These applications handle large data sets that need to be transferred and replicated among different grid sites so 
availability and efficient access are the most important factors affecting the performance. It is obvious that, managing 
the volume of data is very important. Data replication is an important technique to reduces data access time which 
improves the performance of the system by creating identical replicas of data files and distributing them on grid sites. 
In this paper, we propose a novel dynamic data replication strategy called DRPF (Dynamic Replication of Popular 
File), which is based on access history and file’s popularity. As grid sites within a virtual organization(VO) have 
similar interest of files, the basic idea of DRPF is to improve locality in accesses through increasing the the number 
of replicas in the VO. DRPF first selects the popular files that are needed to be copied to other nodes, then tries to find 
the best places for placement of new replicas by taking into account parameters such as the number of demands per 
site for files and bandwidth between replication sites. The algorithm is simulated using a data grid simulator, 
OptorSim. The simulation results show that our proposed algorithm has better performance in comparison with other 
algorithms in terms of job execution time and effective network usage. 


Keywords-Data grid; replication; popular file; placement 
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Abstract — In information security, an image steganography technique uses one of the most popular transforms; either 
a spatial domain or the frequency domain to conceal the secret information. In this paper, an image steganography 
system using the spatial domain technique to conceal secret information in the frequency domain is proposed to 
conceal secret image information in another cover image. The Integer Wavelet Transform (IWT) used to obtain high 
scalable sub bands for each LL, LH, HL and HH of the cover image file. Then, the steganography approach is used to 
conceal the secret information in the wavelet coefficients for all sub bands. The results show high quality of stego 
image, and the stego image is analyzed for different attacks. It is found that the technique is robust, and it can withstand 
the attacks. The quality of the stego image is measured by Peak Signal to Noise Ratio (PSNR), Structural Similarity 
Index Metric (SSIM), and Universal Image Quality Index (UIQI). The quality of extracted secret image is measured 
by Signal to Noise Ratio (SNR) and Squared Pearson Correlation Coefficient (SPCC). 
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Abstract — Managing Alumni System is one of the greatest challenges in the present market of Saudi Arabia. An 
alumni system is a channel between different universities and labor market to deliver various services to students as 
per the merit and priorities. There is no constructive method in present system of Labor office to monitor job requests 
from the students and communicate them with potential changes of market policies. This research aims to provide an 
architecture building a Functional Alumni System in Saudi Universities. The loop holes of current alumni system are 
highlighted and a consolidated methodology is implemented to develop a unique approach for increasing challenges. 
To overcome these deficiencies between Alumni Systems and Labor Market, the preset research provides a runtime 
monitoring system based on Labor policies to attain quality and manageability. The requests placed by students, 
applications executed by labor office and job requests in pending can be monitored and processed with a flexible 
approach by using this method. In turn lot of financial wastage can be avoided by reducing the complexity between 
job seekers and providers by the proposed approach. 

Keywords - Runtime Monitoring, Policy, Alumni System, Saudi Universities, Labor Office, Integration 
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Abstract — Security is one crucial requirement in Wireless Sensor network. To overcome this issue, security protocol 
called Didrip was developed for flat based network which allows for distributed data discovery and dissemination. 
But in terms of clustering approach which is most efficient one in terms of energy conservation, there are lot of security 
vulnerability i.e. checking the cluster head for vulnerability to the network. In addition sensor nodes joining the cluster 
head during user joining phase is also not secure as the nodes can be vulnerable too. These two are most vulnerable 
security issues which are not addressed in existing security protocol of WSN including the one mentioned which is 
Didrip. The above said problems for clustering approach in WSN are overcome with a Cluster-based Certificate 
Authority (CA) scheme which is combination of voting and Nonvoting schemes towards detecting malicious node. 


We also use digital signature to sign all the nodes present in the network. These are simulated using standard network 
simulator ns-2 and results analysed in terms of packet delivery, network life time and energy efficiency. 

Keywords - Didrip, WSN, CA, ns-2 


30. PaperlD 31051696: A Multi-step Method to Calculate the Equilibrium Point of the Continuous Hopfield 
Networks: Application to the Max-stable Problem (pp. 216-221) 

Mohammed El ALAOUI (1), Karim EL MOUTAOUAK1L (2) and Mohamed ETTAOUIL (1) 

(1) Modelling and Scientific Computing Laboratory, University Sidi Mohammed ben Abdellah, Fez, MOROCCO 

(2) National school of applied sciences Al-Hoceima (ENSAH) BP 03, Ajdir Al-Hoceima 

Abstract — The Continuous Hopfield Networks (CHN) is a neural network tools which can be used to solve many 
problems like auto-memory and optimization problems. The dynamics of the CHN is described by differential 
equations system which is hard to solve analytically. That is why, the researchers use the Euler Cauchy method to 
calculate the CHN equilibrium point. Unfortunately, this method suffers from several problems, especially quality of 
the decision for a large step, sensibility to the slope function parameters and to the initial conditions. In this work, we 
use the well-known multi-step numerical method called Adams-Bashforth method, which is strong in terms of stability 
and performance, to calculate the equilibrium point of the CHN associated with the max stable problem. This method 
introduces an intermediary step to improve the Euler Cauchy method precision. The experimental results show that 
the (CHN+ Adams-Bashforth) method produce a large max stable sets in comparison with the (CHN+Euler-Cauchy) 
method. 

Keywords : - Continuous Hopfield Networks, Euler Cauchy method, Adams-Bashforth method, max-stable problem. 
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Abstract — This paper presents the study of an event grouping based algorithm for a university course timetabling 
problem. Several publications which discuss the problem and some approaches for its solution are analyzed. The 
grouping of events in groups with an equal number of events in each group is not applicable to all input data sets. For 
this reason, a universal approach to all possible groupings of events in commensurate in size groups is proposed here. 
Also, an implementation of an algorithm based on this approach is presented. The methodology, conditions and the 
objectives of the experiment are described. The experimental results are analyzed and the ensuing conclusions are 
stated. The future guidelines for further research are formulated. 

Keywords - university course timetabling problem; heuristic; event grouping algorithm 
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Abstract — Watermarking is the concept that provides protection in digital multimedia. This paper uses Discrete 
Wavelet Transform (DWT), Singular Value Decomposition (SVD) and Discrete Cosine Transform (DCT) concept for 
watermarking and extraction purpose. In result analysis we analyze extracted image from watermarked image after 
applying different attacks (like rotation, Gaussian noise, average filter attack, low pass filter, high pass filter, salt and 


pepper, Histogram Equalization etc). We find that this concept is robust against these types of attacks and provide 
high security. 

Keywords- Discrete Cosine Transform (DCT), Discrete Wavelet Transform (DWT), Singular Value Decomposition 
(SVD), Cover Image, Watermark Message. 
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Abstract — Signcrypion is a cryptographic method in which signature and encryption apply on message in a single 
step. On other hand image steganography is a strongest technique for hiding data or information. Therefore 
Communication through insecure channel is challengeable task for an organization. Recently two tier security gain 
popularity because most of the business organizations wants maximum security of data/information. In this paper we 
design a new scheme using cryptographic and stenographic techniques at once on the basis of image steganography 
and elliptic curve cryptography. In proposed design scheme we use both of the steganography as well as cryptography. 
The cryptographic technique encrypts the data by using Elliptic curve cryptography in such a manner that third party 
not understands the original message contents. Stenographic technique is used to hide the text in image and then we 
take hash as well as signature. It also assures the security properties like message confidentiality, message integrity, 
message non repudiation and also message authentication. 

Keywords-component Cryptography, Steganography, Signcrypion, Elliptic curve cryptography. 
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Abstract — Emergency Services Rescue 1122 and Smart Sticker components of our proposed Smart traffic monitoring 
and guidance system model are presented in this paper to provide smart emergency services and to identify vehicles 
to develop advanced transportation system. It involves the Wireless Sensors and actors to communicate with the 
system. The proposed components require fewer resources in terms of sensors and actors. Further, Sensors component 
identifies vehicles through Smart Stickers and it is readable through sensors from its barcode and barcode consists of 
vehicles details in terms of vehicles registration, model, engine and color. Secondly, Emergency Services Rescue 1122 
component provides emergency services as it locates the vehicles through sensors and informs the local authority for 
providing emergency services. Third, violation of rules detects intruders on roads to provide smooth flow of traffic. 
Fourth, to avoid congestion, traffic signals are configured and communicated with sensors to update the system if 
congestion occurs. The proposed components of our model are implemented by developing formal specification using 
VDM-SL. VDM-SL is a formal specification language used for analysis of complex systems. The developed 
specification is validated, verified and analyzed using VDM-SL Toolbox. 
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Abstract — In single cellular networks, the mobile stations cannot communicate directly with each other. All 
communications are relayed through the base stations. Such topology suffers from many limitations such as congestion 
problem when a large number of users are communicating in the same time to a base station. In this context, the 
device-to-device communications have been proposed to overcome the limitations of the conventional cellular 
architecture. Indeed, a mobile station can allow two nearby stations to communicate with each other without involving 
a base station. However, security becomes an important challenge that must be taken into consideration as the mobile 
stations participate in routing data between each other. In this paper, we propose a secure routing protocol for Multi- 
hop Cellular Networks (MCNs). Our goal is to discover a secure and short route between the source and the 
destination. To evaluate this proposed protocol, we perform some simulations using Network Simulator (NS-2). The 
simulation results show that it provides acceptable performance in terms of throughput and routing overhead as 
comparing with Secure Ad hoc on demand Distance Vector (SAODV). 

Keywords-component; single cellular networks, base stations, Device-to-device, secure routing protocol, MCNs, NS- 
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Abstract - Investigation on privacy preserving data mining is in extensive need to the present day technological 
situation. Storage of the data and its usage through various computational processes is becoming very easy and 
efficient. At the other end the primary concern or sometimes can be termed as limitation to this extensive data analysis 
is privacy. There are existing privacy preserving techniques that solve this problem and also guarantee privacy as well 
as data utility. But these techniques have to be updated in parallel to the expansion of digital technology. In view of 
this, the part of research in this paper analyses various normalization techniques with heterogeneous data distortion. 
The experimental consideration is done with the comparison of various statistical measures on the distorted data and 
their preservation with respect to the original data. We evaluated the performance of heterogeneous data distortion 
with three types of transformations namely Min-Max Normalization, Z-Score Normalization and Decimal Scaling. 
The performance is evaluated with various data distortion measures and privacy measures. 

Keywords: Privacy Preserving Data Mining (PPDM), Data Normalization, Privacy, Data utility. 
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Abstract — There is a correlation between pixels in each image so that each pixel value of adjacent pixels can be 
guessed. By removing these dependencies can be compressed images. Our goal is to reduce the amount of compressed 
image data needed to display the digital images and therefore reduce the cost of transmission and storage. Compression 
has a key role in many important applications. These applications include image database, transmission of images, 
remote sensing, medical imaging, military and space equipment remote control and so on. In addition to the 
compression, image coding, there's talk. That after quantization matrix should be coded range of conversions. In 
reconstruction after decoding to achieve our desired image obtained with the difference that the picture is far less than 
the original image. What we've done in this thesis using a fractal method utilizes a Kohonen neural networks and 
clustering to increase the compression ratio and reduction coding and decoding the image. We have implemented three 
methods based on fractal coding. The first method is simple fractal coding. In the second method to create the 
codebook of multiple tree fractal coding is used. In the second method of vector quantization LBG algorithm for 


Kohonen neural network-based clustering algorithm and code book for coding image is used. Results in the second 
method show faster encoding. The method is simple fractal compression rate is higher than other methods. 

Keyword: image compression; clustering; vector quantization 


38. PaperlD 310516122: A Joint Duty Cycle and Optimal Energy Adaptation Algorithm for the Body Area 
Sensor Networks (pp. 269-274) 

Ali Raza, Dept, of computer science, City University of Science & Information Technology, Peshawar, Pakistan 
Arshad Farhad, Dept, of computer science, COMSATS, Sahiwal, Pakistan 
Wajid Ullah Khan, Dept, of computing, Abasyn University, Peshawar, Pakistan 

Muhammad Arif, Dept, of computer science, City University of Science & Information Technology, Peshawar, 
Pakistan 

Abstract — IEEE 802.15.4 standard is widely adapted for Body Area Sensor Networks (BANs) due to its low duty 
cycle and low power operation. However, IEEE 802.15.4 recommends the use of fixed duty cycle operation which 
results in high energy consumption and end-to-end delay. Therefore, an efficient algorithm is needed to adapt duty 
cycle operation to overcome the end-to-end delay and energy consumption. In this paper, we propose a Joint Duty 
Cycle algorithm (JDCA) for the BAN to enhance the network lifetime, throughput and decrease the end-to-end delay. 
Dynamic duty cycle can be adapted by the two MAC parameters: Beacon Order (BO) and Super frame Order (SO). 
However, these parameters are set by the network administrator before the network deployment. During simulation, 
JDCA algorithm is capable of adapting dynamic duty cycle at run time based on traffic load. Furthermore, simulation 
results shows enhanced network lifetime, network throughput and less end-to-end delay when compared with IEEE 
802.15.4. 

Index Terms — Dynamic duty cycle, IEEE 802.15.4, Body area sensor networks, Wireless personal area network. 
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Abstract — This paper presents the evaluation performance of broadband hybrid satellite constellation communication 
system (BHSCCS) networks which provides high performance data transfer in grid network environment based on 
TCP protocols. The evaluated hybrid satellite network uses the COMMStellationTM constellation topology on lower 
orbital. We adopt the GridFTP to improve network performance. GridFTP is a high-performance, reliable data transfer 
protocol optimized for high-speed Internet to suitable WAN networks. The simulation results show the network 
performance of GridFTP which different AQMs, TCPs, PERs, over BHSCCS networks. 

Keywords: COMMStellationTM; GridFTP; Hybrid Satellite; Queue; TCP 
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Abstract — Wavelet Neural Network (WNN) is attracting interest in field of classification system, because they are 
universal approximations, particularly due to rapid and accurate representation of nonlinear dynamic systems. The 
satisfying performance of the WNN depends on an appropriate determination of the Wavelet Neural Network 
structure. In this paper we provide a new method to solve this problem based on the Least Absolute Shrinkage and 
Selection Operator (LASSO). At first, the scale of WNN is managed by using the time-frequency locality of wavelet. 
Furthermore, the unconstrained optimization problem (LASSO) is used to solve the structure and learning of the 
WNN. This optimization problem can be solved efficiently using the iteratively reweighted least squares (IRLS) and 
the Least Trimmed Square (LTS) methods to enhance the ineffectiveness; they are applied to train the wavelet neural 
network. The advantage of the method lies in the oracle properly of the LASSO can guarantee the optimal structure 
of the WNN. The proposed method has been able to optimize the wavelet neural network and this method is able to 
classify the DNA sequences. Our goal is to construct predictive models that are highly accurate. In fact, the proposed 
method permits to avoid the complex problem of form and structure in different clusters of organisms. The empirical 
results and their classification performances are compared with other methods. We compared the WNN-Lasso model 
with the other five alignment-free models, i.e., k-tuple, DMK, TSM, AMI, and CV, on several large-scale DNA 
datasets on the DNA classifying application by means of the K-means method. The experimental results have shown 
that the WNN-Lasso model outperformed the other models in terms of both the classifying results and the running 
time. Evenly, in this study, we present our approach consists of three phases. The first one, which is called 
transformation, is composed of two sub steps; binary codification of the DNA sequences and the Signal Processing of 
the DNA sequences. The second phase step is the approximation; it is empowered by the use of the Multi Library 
Wavelet Neural Networks (ML WNN). Finally, the third section, which is the classification of the DNA sequences, is 
realized by applying the algorithm of k-means classification. 

Index Terms — LASSO, LTS, Wavelet Neural Networks, DNA sequences, ML WNN, IRLS. 
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Abstract- Morphological analysis is the process of constructing and deconstructing the words of a language, the 
process is based on the basic grammatical units which are stem, prefixes, suffixes and infixes. Sindhi is rich in 
morphological features with a great variety of affixes. The problem for Sindhi to come into computerization is the 
large number of variants in its morphology. This complexity is created due to different positions of prefixes, suffixes 
and stems in the words. The automatic word segmentation system normally faces such embedded hurdles in Sindhi 
language. An algorithm is required with a capability of dealing with such issues for the segmentation of Sindhi words. 
In this paper, an algorithm is designed and implemented to resolve the problem of segmenting Sindhi complex and 
compound words into possible morphemes. The developed words segmentation system has been tested on a list of 
109 compound words, 179 prefix words, 1343 suffix words and 50 prefix-suffix words. The cumulative segmentation 
error rate of 5.02% is calculated. This system can also be used as pre-requisite in various Sindhi language and speech 
processing applications. 

Keywords — Sindhi Morphology; Morphological Analysis; Word Segmentation; Morphemes 
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Abstract — Most of the existing secret sharing schemes are based on polynomial interpolation. In other word, they 
use polynomial functions in their schemes. In this paper, we solve the problem of creating a secret sharing scheme 
based on rational interpolations. We show that if * support points have the same width then the rational interpolation 
of the support points, which is called ( )( ), has pole points. Finally, we give an example for the accuracy of the 
proposed scheme. 

Keywords -component; Secret Sharing Scheme; Shamir’s Scheme; Polynomial Interpolation; Rational Interpolation, 
Pole Points. 
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Abstract — Although there are various biometric techniques, like fingerprints, iris scan as well as hand geometry, the 
most efficient and widely-used one is face recognition because it is inexpensive, non-intrusive and natural. In our 
paper, we present an approach aiming at implementing a full architecture which represents an efficient system of face 
recognition. For this, an attempt is proposed for each system stage. At the beginning, we develop a novel approach to 
detect faces existing in 2D color image. This approach focuses mainly on how to implement a selection of skin color 
before using neural networks and Gabor filters. This approach represents an improvement of existing approach 
especially because it aims to minimize the computation time. Indeed, the skin detection step avoids wrong detection 
and to help the system detect the face in the right areas and minimize the research time and subsequently the Gabor 
filter will be applied only on the localized skin space. Later, the face features obtained by the Gabor filter represent 
the input of the neural network classifier to decide whether an input image pixel is a face pixel or not. For 2D face 
recognition, we propose likewise a novel approach that we call HMMLBP (a combination of the two tools Hidden 
Markov Models HMM and Local Binary Pattern LBP). It allows classifying a given 2D face image through utilizing 
an LBP tool to extract features. In order to validate our whole system performance, we show experimental results 
obtained when applying our proposed algorithm on benchmark face databases, respectively AT&T, Yale and Feret. 
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Abstract — Cloud computing gaining popularity at enormous rate since from its emergence. CC changed the way that 
computing services are provided. On demand platform (PaaS), infrastructure as a service (Iaas) and software (SaaS) 
as a service through internet. Consumer use third party services instead of building his own infrastructure which need 
up-front investment and expertise. Cloud computing becoming popular for unlimited computing power, availability, 
nice pricing, on demand services and quality of service. For availability and computing power the service provider 
expands their resource capacity to handle user requirements. This expansion in resources capacity lead to high energy 
demand. Two big issues for cloud computing is energy demand and security/privacy requirements. In this survey we 
will give a review on the latest techniques for energy efficiency in cloud computing. The main focus is on software 
base energy efficiency techniques in which we will explain the workload consolidation and resource management in 
detail. 

Index Terms — cloud computing, data center, energy efficiency techniques. 
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Abstract — Cloud computing provides distributed resources to the users globally. Cloud computing contains a scalable 
architecture which provides on-demand services to the organizations in different domains. However, there are multiple 
challenges exists in the cloud services. Different techniques has been proposed for different kind of challenges exists 
in the cloud services. This paper reviews the different models proposed for SLA in cloud computing, to overcome on 
the challenges exists in SLA. Challenges related to Performance, Customer Level Satisfaction, Security, Profit and 
SLA Violation. We discuss SLA architecture in cloud computing. Then we discuss existing models proposed for SLA 
in different cloud service models like SaaS, PaaS and IaaS. In next section, we discuss the advantages and limitations 
of current models with the help of tables. In the last section, we summarize and provide conclusion. 

Index Terms — Service Level Agreement (SLA), Cloud Computing. 
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Abstract — 3D mesh is a new data type appeared in the last decades. Since its emergence, it has been used in several 
areas which raise major security problems. As a solution, we propose a blind watermarking algorithm for 3D meshes. 
For doing spiral scanning method decomposes the mesh into GOTs (a Group of Triangles). At each time, only one 
GOT will be uploaded into memory. It undergoes a wavelet transform to generate vector of wavelet coefficients. This 
latter undergoes modulation then embedding steps using data coded with BCH code. Once watermarked, the next 
GOT will be uploaded. This process stopped when the entire mesh is watermarked. Experimental tests show that the 
quality of meshes is kept despite the high insertion rate and that memory consumption is reduced. As for robustness, 
our algorithm overcomes the following attacks: translation, rotation, smoothing, uniform scaling, coordinate 
quantization, noise addition, simplification and compression. 

Index Terms — Digital watermarking, 3D meshes, Multiresolution, Wavelet transform, Spiral scanning, Attacks, 
Compression. 
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Abstract — Internet of Things (IoT) is an emerging technology which is covering everyday things from industrial 
machinery to consumer goods in order to exchange information and complete tasks while involved in other work. IoT 
based smart home automation system is a system that uses PCs, mobile phones or remote devices to control basic 
operations for home automatically from anyplace around the world using internet. The proposed intelligent home 
automation system differs from existing systems as it allows the user to operate the system from anywhere around the 
world by using internet connection along with intelligent nodes that can take decisions according to the environmental 
conditions. We implemented a home automation system using sensor nodes that are directly connected to Arduino 
microcontrollers. Microcontroller is programmed so that it can perform some basic operations on the basis of sensors 
data. e.g. fan is controlled on basis of temperature value and light is controlled on the basis of occurrence of motion 
in the room etc. Furthermore Arduino board is connected to the internet using Wi-Fi module. An extra feature this 
system provides is to monitor power consumption of different home appliances. The designed system provides the 


user remote control of numerous appliances locally as well as outside the home. This designed system is expandable, 
allowing multiple devices to be controlled. The objective of the proposed system is to provide a low cost and efficient 
solution for home automation system by using IoT. Results show that the proposed system is able to handle all 
controlling and monitoring of home. 

Keywords — Internet of Things (IoT), Wireless Sensor Network, Home Automation System, Energy Monitoring. 


48. PaperlD 310516142: A Threshold-Based Predictive Scheme for Mobile Subscribers in Publish/Subscribe 
Systems (pp. 351-357) 

Fatma Abdennadher, Maher Ben Jemaa 

National School of Engineers ofSfax, University ofSfax, ReD CAD Laboratory, B. P.117 3, 3038 Sfax, Tunisia 

Abstract — In this paper, we present our strategy adopted to deal with the mobility into publish/subscribe. Specifically, 
we focus on the management of the mobile users from one broker to another. In fact, the topic of mobility into 
publish/subscribe systems may cause many problems such as the increasing of the traffic into the network and the 
messages loss. To overcome these problems, we have created a selective scheme on the basis of an accurate selection. 
In fact, a threshold value is devoted to be the criterion for the selection of caching points. On the basis of this principle, 
we apply various network settings to explore the effectiveness of our approach. Hence, we extract the improvement 
of our approach on the messages loss, the caching cost and the propagation cost in function of buffer size, publication 
rate, period of disconnection and connect time. 

Keywords-Distributed Networks; Mobile Computing; Publish/Subscribe; Prediction Management; Performance 
Efficiency. 
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Abstract — Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and 
systems engineering concepts to develop and improve transportation systems of all kinds. Vehicular Ad-hoc Network 
(VANETs) which is an application of Mobile Ad-hoc Networks (MANETs) play an important role in ITS and emerged 
to provide Vehicle to Vehicle, Vehicle to Roadside and Vehicle to Infrastructure communications, aiming to improve 
safety on roads, exchange data between vehicles and provide different services to the users. According to special 
characteristics of VANETs like bandwidth limitation, high mobility, signal fading and real-time data communications, 
QoS provisioning in these networks is a challenging task. In this paper, we introduce an architecture for vehicular 
networks and a protocol stack which aims to reduce the processing overhead, make routing easier and provide Quality 
of Service in vehicular networks. Finally, after designing protocols and headers of the mentioned protocol stack, we 
will simulate our proposed idea in a vehicular environment and after simulation process, we will compare the achieved 
results with another scenario in which regular TCP/IP protocols are used. 

Keywords -component; VANETs; ITS; QoS; Protocol Stack 
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Abstract — Transition from IPv4 to IPv6 is a cumbrous process because of their irreconcilability with each other and 
coexists during the transition period. This work examines the behavior of transition mechanisms that involve 
communication among IPv4 and IPv6 in various scenarios and traffic conditions. A network analyst faces variable 
traffic and data rates at different nodes in such a heterogeneous network, that requires more attention to make it able 
to work with stable network flow and data rate. We analyse an end-to-end delay of VOIP data packets in IPv4 and 
IPv6 homogeneous and heterogeneous networks using 6 to 4 tunneling techniques. This work shows that IPv6 has 
better performance than IPv4 and IPv6-to-IPv4 tunneling. The tunneling technique improves the network throughput 
and queuing delay over the intermediate nodes of the heterogeneous network. 

Keywords: IPv4, IPv6, VoIP, 6- to-4 tunneling, DSTM 
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Abstract — Today as Android is used by majority of the smartphone users it has become one of the effortless platform 
for the malware-writers to introduce their malicious activities into smartphone world through this android mobile 
applications. The main loophole in Android applications is permission based security control. The User unawareness 
of accepting every permission as a mandatory requirement by an app is making more and more convenient for the 
hackers to extract the users’ private data. In this paper we have analysed all the leakages which are done by using 
permissions required by an app. We carefully made an investigation to detect collusion attacks .We analyzed the 
present detection methods of inter-permission leaks especially on Collusion attacks and mentioned the areas where 
the enhancements are needed with limitations that existed in present detection methods. 

Keywords - Collusion attacks, inter-permission leaks 
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Abstract — Requirements elicitation is the first and the most critical phase of Requirements Engineering (RE). Many 
techniques have been proposed to support the elicitation process. Each technique has its strengths and weaknesses. 
This variety makes the selection of technique or combination of techniques for a specific project a difficult task. 
Mostly techniques are selected based on personal preferences rather than on attributes of project, technique, and 
stakeholders. In this paper, the researchers propose a three-component approach for elicitation techniques selection. 
First, a literature review is conducted to identify the attributes affecting techniques selection and common elicitation 
techniques. Second, a multiple regression model is built to analyze these attributes in order to find the critical attributes 
influencing techniques selection. Finally, an Artificial Neural Network (ANN) based model for selecting adequate 
elicitation techniques for a given project is proposed. The ANN model helps reduce the human involvements in this 
process. It was implemented using Neural Network Fitting Tool in MATLAB. The network has accuracy of 81%. The 
ANN model was empirically validated by conducting a case study in a software company. 


Keywords: Requirements Engineering, Requirements Elicitation, Multiple Regression Analysis, Neural Network. 
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Abstract — Proxy Re-Encryption has been used since the need for forwarding an encrypted message to a party for 
whom it was not encrypted was highlighted in the form of delegation rights by Blaise, Bleumer and Strauss. Various 
Proxy Re-Encryption schemes have been introduced till today mainly focusing on demonstrating features like 
transitivity and collusion-resistance to ensure minimal trust on the proxy and maximum key-privacy. This survey 
highlights some major schemes introduced, classifies them based on their directionality, brings to light their major 
advantages and disadvantages, and provides a detailed comparative study based on the key features a Proxy Re- 
Encryption Scheme must possess in order for its widespread. 

Index words — bilinear maps, CCA secure, collusion resistance, CPA secure, delegation rights, Deffie-Hellman key 
exchange, DBDH assumptions, Proxy Re-Encryption; transitivity. 
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Abstract — WSN is an evolving technology since last ten years. As wireless nodes work have less power supply in 
the form of a battery, it is necessary for the nodes to work for maximum time. Different techniques are adopted to 
achieve better energy optimization. This paper presents a survey on energy efficient routing techniques, which will 
help in understanding the factors which affect energy efficiency and other performance parameters and will help to 
analyse the techniques for further optimizations. 

Index Terms — Wireless Sensor Networks, Energy optimization, Topology. 
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Abstract — Face partitioning technique is presented in this paper. Instead of directly giving the face to the face 
recognition system, first the face is partitioned in to different face parts using face partitioning technique. The face 
parts are namely mouth, left eye, right eye, head, eye pair and nose. Eigen and Fisher features based algorithms are 
considered for experimental purpose. These face part features are given to the SVD classifiers individually. The 
outputs of the classifiers are again given to the decision making algorithm. Based on the maximum likely hood 
principle, this decision making algorithm outputs a face. ORL data base is used for evaluating the performance of this 
new technique. The first two faces of all the 40 people in the data base are considered for testing and the remaining 
eight faces are used for training purpose. Results are separately calculated with and without face partitioning 
technique. Results show that face recognition rate is increased by using the combination of face partitioning technique 
and basic face recognition algorithm. The new algorithm is also verified on 8 different data sets. Experimental results 
show that this face partitioning is improving the face recognition rate both Eigen and Fisher feature based algorithms. 


Index Terms— Face Partitioning, Facial features, Recognition engine, Support Vector Machine, Decision making 
algorithm. 
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Abstract — Software as a service (SaaS) is a Cloud Computing service model that exploits economies of scale for 
SaaS service providers by offering a single configurable software and computing environment for multiple tenants. 
This contemporary multi-tenant service requires a multi-tenant database that accommodates data for multiple tenants 
using a single database schema. In general, traditional Relational Database Management Systems (RDBMS) do not 
support multi-tenancy and require schema extensions to provide multi-tenant capabilities. This paper proposes a multi- 
tenant database schema called Elastic Extension Tables (EET), which is highly flexible in enabling the creation of 
database schemas for multiple tenants by extending a preexisting business domain database, or by creating tenant 
business domain database from the scratch at runtime. The empirical results presented in this paper indicate that the 
EET schema has potential to be used for implementing multi-tenant databases for multi-tenant SaaS applications. 

Index Terms — Cloud Computing, Software as a Service, Multi-tenancy, Elastic Extension Tables, Multi-tenant 
Database. 
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Abstract — The availability of network services are being menaced by the increasing number of Denial-of-Service 
(DoS) attacks. The availability of such interconnected systems is severally degraded by increasing number of DOS 
attacks. Denial-of-Service (DoS) attacks cause serious impact on these computing systems such as router, host or 
entire network. DoS attack detected using Multivariate Correlation Analysis (MCA) technique. Multivariate 
correlation analysis employs for accurate network traffic characterization by extracting the geometrical correlations 
between network traffic features. The proposed system uses the Multivariate Correlation Analysis (MCA) technique 
for accurate characterization also uses the anomaly based detection technique in attack recognition. Anomaly based 
detection makes system capable of detecting seen and unseen attacks. Moreover, a triangle area based technique is 
planned to reinforce and increases performance of MCA. The impact of each non-normalized information and 
normalized information on the performance of the proposed detection system is tested. 

Keywords — Denial- of- Service attack, network traffic characterization, multivariate correlations, triangle area. 
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Abstract - SQL Injection vulnerability takes advantages of the poorly coded web application and exploits the sensitive 
and critical information stored in an application’s database by compromising the authentication logic of the database 
server. In Most of the web applications user inputs in the dynamic web pages are the vulnerable points for SQL 


injection attack. A Single detection tool cannot handle the sophisticated injection attacks by the intelligent hackers. 
The proposed hybrid model with SQLI-Rejuvenator on an Application Program Interface is tested and proved as an 
efficient technique to detect and prevent SQL injection. In this architecture, the malicious queries are blocked and an 
alert message is generated if the injection is detected. Only the benign query is allowed to access the data from the 
backend database server. The Unique identity created by the template creator application, the Rejuvenator module and 
evaluation engine are significant features of the proposed model to prevent the Injection attack and can facilitate better 
availability of the application. 

Keywords - Authentication; Injection; Vulnerability; Hackers; Detection; Rejuvenation; 
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Abstract - In this article, we will propose a real-time human hand gesture recognition system which will perform 
translations from the sign language to the common French language. The processes is composed by three basic steps: 
The detection and extraction of the hand pattern characteristics during the image stream acquisition, which is obtained 
from an integrated camera. The analysis process, in which the obtained characteristics are classified as either a 
recognized sign language gesture or an unclassified hand movement. Preset characteristics of each effective hand 
gesture are stored locally. The message-assembling phase: at the end of cycle of each iteration of the two previous 
steps, the obtained result is either neglected or concatenated with the assembled message so far. The message is then 
displayed. 

Keywords: human-machine communication, gestural interaction, French sign language, linked gesture recognition. 
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Abstract - In this paper, we have proposed a robust technique to detect and classify the tumour part from medical brain 
images. In recent times, a number of image segmentation and detections techniques have been proposed in the 
literature. But, the detection of brain tumour through the help of classification technique has received significant 
interest among the research community. By considering the above issue, here, we combine three different techniques 
such as, cuckoo search, neural network and fuzzy classifier to detect the tumour part effectively. Our proposed 
approach consists of four phases, such as, pre-processing, region segmentation, feature extraction and classification. 
In the pre-processing phase, the anisotropic filter is used for reducing the noise and in the segmentation process; K- 
means clustering technique is applied. For the feature extraction, the parameters such as contrast, energy and gain are 
extracted. In classification, a modified technique called Cuckoo-Neuro Fuzzy (CNF) algorithm is developed and 
applied to detection of tumour region. In the modified algorithm, cuckoo search algorithm is employed for training 
the neural network and the fuzzy rules are generated according to the weights of the training sets. Then, classification 
is done based on the fuzzy rules generated. Experimental results shows that the proposed technique achieved the 
accuracy of 79.49% but existing technique achieved only 76.92%. 

Keywords: CNF, contrast, energy, entropy, K-Means, anisotropic filter, sensitivity, specificity, accuracy 


61. PaperlD 310516183: Permission Based Android Malware Detection System using Machine Learning 
Approach (pp. 465-470) 


Mayuri Magdum, Computer Engineering, Modern Education Society ’s College of Engineering, Pune, India 
Prof Sharmila K. Wagh, Computer Engineering, Modern Education Society's College of Engineering, 

Pune, India 

Abstract — Mobile computing has grown and developed in recent years with huge popularity. Gadgets like Smart 
phones, Tablets, etc have become trendy by the ease of use. Android is more famous platform and turned out to be the 
most important target of Malware developers in precedent years. The malware hazard for cellular telephones is 
evaluated to increment security and usefulness of smartphones. Hackers and malware program developers are 
benefitted by the limited capabilities and lack of standard security mechanism of Android. Nowadays smart phones 
are omnipresent, i.e. they fill numerous needs such as data storage, personal mobile communication, multimedia and 
entertainment etc. therefore, implementing secure mobile connections is challenging. As a result, it becomes essential 
to have some valuable and probabilistic detection along with preventive mechanisms. Many preventive tools are 
available in market but current trend for malware security is before installing the app user should be able to identify 
possible threats. Hence we propose permission based mobile malware detection system. It has 3 components in it 1) 
Client 2) Server 3) Signature Database. In the whole analysis process, Server plays important role and user is warned 
at the end of analysis process whether the requested app contains malware or not. 

Keywords- Mobile, Android, Malware, Security, Machine Learning, Static Analysis. 
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Abstract — Increasing amount of dependability on computer networks and internet services are also increasing 
intrusions. Intrusion Detection System (IDS) tools detect the intrusions and produce alerts. An automated Intrusion 
Response System (AIRS) is required to analyze the alert and trigger appropriate response to mitigate the intrusion 
without delay. In this paper, cost evaluation methods and response decision making capabilities of various AIRS 
models are analyzed. Various decision making factors that are involved in the response selection process are also 
identified and then categorized in response, attack and system level factors. 

Index Terms — Intrusion Response System, AIRS, Response selection, Response factors, Response cost. 
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Abstract — SQL Injection Attack (SQLIA) is a technique of code injection, used to attack data driven applications 
especially front end web applications, in which heinous SQL statements are inserted (injected) into an entry field, web 
URL, or web request for execution. “Query Dictionary Based Mechanism” which help detection of malicious SQL 
statements by storing a small pattern of each application query in an application on a unique document, file, or table 
with a small size, secure manner, and high performance. This mechanism plays an effective manner for detecting and 
preventing of SQL Injection Attack (SQLIA), without impact of application functions and performance on executing 
and retrieving data. In this paper we proposed a solution for detecting and preventing SQLIAs by using Query 
Dictionary Based Mechanism. 

Index Terms — SQL Injection Attack, SQL Injection Attack Detection, SQL Injection Attack Prevention, Query 
Dictionary. 
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Abstract — Most of intrusion detection researches suffer from the following drawbacks: Dependencies between 
network nodes and cluster-like behavior of anomalies. Hence, this paper proposes a cluster-based approach in which 
the anomalies are detected using a new criterion related to the behavior of attacks. In addition, we provide a cluster- 
based data set which uses the flow-based data and graph properties to model the network traffic over time. The data 
set is built over the DARPA. Moreover, the anomalies are revealed by means of a criterion which is computed from 
internal and external weight of clusters. Finally, the proposed approach is evaluated and compared to other approaches. 
The evaluation results show the preference of our approach relative to other ones. 

Keywords- Anomaly; DARPA data set; flow; graph clustering; intrusion detection 
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Abstract — Determining a vehicle’s trajectory is a complex and hard to solve type problem in the literature and it is 
identified as a NP-Hard optimization problem which is studied in different engineering disciplines such as computer, 
electrical and industrial engineering. It has been observed that such complex problems can be solved by using various 
approaches and lots of them are focused on the usage of Evolutionary Algorithms especially in case of a large number 
of controls points which are needed to be visited. Although these algorithms provide near optimal solutions, in the 
real world, vehicles are not able to follow this determined path (trajectory) without any deviation. Because vehicles 
are moving objects and each one moves with a certain speed. Therefore it is impossible for a vehicle to make a sharp 
turn after visiting control points. These vehicles need to make smoothed turns over these points. Therefore there will 
be a certain difference between the calculated path and the real path. It is needed to determine the real path by using 
necessary mathematical solutions for smoothing these paths. To ensure the motion continuity of vehicles, they need 
to follow paths determined according to a certain criterion. In this study, the most common smoothing methods which 
are used to ensure these continuities (Bezier, B-Spline and Dubins) have been compared and it is aimed to show the 
different approaches in an application area of path planning problems as a comparative study. 

Keywords — Unmanned Aerial Vehicle, Path Planning Evolutionary Algorithm, Bezier Curves; B-Spline Curves, 
Dubins Path. 
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Abstract - Since the last two centuries, humanity has made scale steps in this attraction to innovation and technological 
progress. The emergence of global networks of computers corresponding to Wireless Sensor network WSN is one of 
those great steps that man could do. WSN is an advanced technology that occur in response to overcome user needs. 
It resolves many problem such as, controlling phenomena, monitoring places, and diagnostic. Nevertheless, this 


advanced technology still incomplete in order to different constraints such as energy consumption, routing, aggregated 
data and security, also routing information represents a critical issue in it. For that, great researches designed. In this 
paper, we present a survey of GAF and their enhanced versions as Location-Based routing protocols in WSN, which 
allows reducing the consumed energy in the network and prolonging the network lifetime. 

Keywords: WSN, routingprotocols, location-based, GAF. 
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Abstract - Cryptography is a very useful tool to protect the properties of data like integrity, privacy, confidentiality in 
any environment. This paper explores some useful aspects of cryptography in cloud computing environment. There 
are different types of encryption algorithms used in order to ensure the data security. These algorithms are of different 
types like symmetric, a symmetric and hashing algorithms. The objective of this paper is performance analysis of 
selected set of algorithms on the basis of different parameters, so that the best out of all these options is chosen or 
combinations of some of them can be utilized to secure data in cloud computing environment. The algorithms included 
in this study are RC2 and AES. The parameters which are used for performance analysis are running time of the 
algorithm, data encryption capacity. These are the performance parameters which are calculated for every algorithm 
in cloud based environment i.e. windows azure simulator by utilizing visual studio IDE and profiler services by 
integrating windows azure SDK. The interpretation of these results are done by using various graphs which shows 
trend of a particular algorithms on basis of time of encryption and decryption. 

Keywords: Cryptography, Cloud Security, RC2, AES, Windows Azure 
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Abstract — Due to continuous evolution in hand handled mobile devices such as Smartphones, Laptops, tablets and 
Personal Digital Assistants (PDAs) have increases the volume of traffic on Internet radically. To provide seamless 
Internet services and perpetual mobility to these devices, Internet Engineering Task Force (IETF) has proposed various 
mobility management protocols such as MIPv6, HMIPv6, and PMIPv6. MIPv6 is a host-based mobility management 
protocol and suffers from handover latency, packet loss etc. Recently the IETF proposed network-based mobility 
management protocol, known as Proxy Mobile IPv6 (PMIPv6). PMIPv6 sufficiently reduces signaling overhead but 
still have long authentication latency during handover and packet loss issues. To resolve these issues, an optimized 
and secure authentication mechanism for handover management scheme for PMIPv6 networks is proposed in this 
paper. Due to less authentication delay, the proposed scheme reduces the setup time and as a result has low handover 
latency. Subsequently, decreases the amount of packet loss during handover. The proposed scheme provides higher 
security infrastructure than the basic PMIPv6 protocol and additionally reduces the handover latency to contemporary 
protocols. The performance and results are mathematically analyzed. Numerical results show that the proposed scheme 


gives better performance than the existing MIPv6 in terms of signaling delay and provide higher security than PMIPv6 
protocol. 
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Abstract — Due to mass global migration and increased usage of the Internet, it is now very important to address the 
cultural aspects of the usability problems of any Information and Communication Technology (ICT) products such as 
software, websites or applications (apps) whether to be used on PCs, Laptops, Smartphones, Tablets, Smart TVs or 
any other devices. To augment the “Design for All” concept, this research demonstrates the need to cater for culturally 
diverse users while designing user interfaces. This has been achieved, by investigating ICT products and conducting 
an extensive literature survey. The study concludes that it is very important to work on cross-cultural usability 
problems and bring these issues under focus. 

Index Terms — Human Computer Interaction (HCI), Universal Usability, Cross-cultural Usability, User Interface 
(UI) Design, Design for All, Users’ Behaviour. 
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Abstract - Over the years road traffic flow has seen pedestrian crossing as a major issue in the society, particularly in 
urban areas where there is no control for pedestrian road crossing. In mixed traffic conditions pedestrian road crossing 
behavior is a serious hazard for pedestrians crossing uncontrolled bi-intersection localities. Due to increase in motor 
vehicle growth there is an increase in the regulation of motor vehicles only and the regulation of pedestrian is 
completely neglected in urban area. An increase the uncontrolled road crossing behavior of pedestrian is raises 
different safety and economic concerns. This paper employs computational modeling to regulate the traffic flow across 
a two way intersection. It is caters how pedestrians can cross a bi-intersection traffic signal without disrupting the 
traffic flow. Existing computational models that have been presented by other authors are discussed which gives more 
understanding how to control traffic flow for vehicles and pedestrians handling. This study deals three scenarios of 
real environment for control of traffic flow for pedestrians; with no turns, with turns and with turns. All scenarios 
provides proper notation for ‘on states’ and ‘off states’ of signal. Experimental result demonstrates that the proposed 
method achieved waiting time for vehicles 143.35 seconds and 200.23 seconds for pedestrians respectively. 
Furthermore, result shows the decrement of time and economical resources that are used in the daily commute. 

Index Terms — Pedestrian, Bi-intersection, uncontrolled traffic, Computational Modeling, Traffic Control System 
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Abstract - In communication networks, the data encryption has been used to safe the security of information. There 
are different encryption techniques that can be used to protect the data from unauthorized third person to access. This 
paper deals with chaos image encryption environment to hide the secret information and make communication 
undetectable. In this paper integer wavelet transform (IWT) and discrete cosine transform are used for increasing 


hiding pixel distribution. The work uses IWT and DCT as a decorrelation stage for adjacent pixels. The performance 
evaluation for the proposed algorithm has been done by measuring the application using a series of tests. The tests 
include histogram analysis and visual test, correlation analysis encryption quality, information entropy, randomness 
test, sensitivity analysis and differential analysis. The proposed cipher algorithm experimental results show 
satisfactory security and efficiency levels for image encryption. 

Keywords: Chaotic Encryption; AES; RC4; Statistical Analysis 
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Abstract - In this paper, Multi-Objective Inclined Planes Optimization (MOIPO) algorithm, as a novel multi-objective 
technique, is used to design ensemble classifiers with high reliability and high diversity. It is noteworthy that 
sometimes, the reliability in decision of a classifier is more important than its recognition rate. Security and military 
applications are obvious instances to show the importance of this measure. In addition to reliability, diversity, as a 
main issue in ensemble classifiers, is considered as objective function. So, designing heuristic ensemble classifiers 
with high reliability and also, high diversity has a special importance but the basic point is that the applied heuristic 
algorithm has a stochastic nature and hence, stability analysis of this system is necessary. In this research, statistical 
method is used to do stability analysis of designed ensemble classifier. 
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Abstract — Kalman filter is a very effective approach for data fusion. But, the definition of process, measurement 
noises, and the matrices Q, R have a great impact on the filter performance. Research works show that adjustment of 
matrices Q, R during the prediction process is very useful to reduce the estimation errors. So, in this paper, we attempt 
to increase the accuracy of Kalman filter used in INS/GPS integration algorithm by estimating measurement 
covariance matrix, R, based on measurement data from GPS. Our objective is to show a performance enhancement of 
a conventional extended Kalman filter used in an INS/GPS integrated navigation system by adjusting adaptively 
measurement noise covariance matrix R. This adaptive adjustment is necessary. Because, environment conditions in 
many systems usually are not constant and change continually. 

Index Terms — Integrated navigation, Extended Kalman filter, Adaptive Kalman filter 
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Abstract - Multimedia has become part of our day today life especially when it comes as images. Many studies have 
proved that images are the most efficient way of expressing our feelings rather than a page of paragraphs. An example 
we can state here is the smileys we use in our messages for expressing our thoughts. The ultimate rise of social websites 
like Google+, Twitter and Facebook, playing major role in the Internet World has proved it wright since these websites 
are rich in content and huge number of images shared. The revolutionary technology development in the mobile 
industry is also playing the major role in using such multimedia content. Since the images are being shared in different 


ways, people start compressing the images to reduce the huge amount of memory space. This compression leads to 
data loss (pixel) in images which affects the quality of the images. Many solutions have been identified to solve the 
issues. One such system uses one dimensional approach in all four directions (Row, Column, Diagonal and Inverse 
Diagonal); the recovery process is performed by considering the edge pattern of the existing image adjacent to the 
damaged data (pixel). The system also uses the method of determining the weighted sum [1] of selected point 
functions. Many more techniques followed like enhancement performed using: Spatial and Time domain [1], 
Frequency Domain Techniques [1], Brightness Preserving Bi-Histogram Equalization (BBHE) [2]. 

Keywords: Image Enhancement, Data Loss, Recovery process 
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Abstract - In this paper, a new simple encryption technique is proposed for gray scale image encryption. The current 
technique, Cascaded Combined Permutation (CCP), is a simple technique based on the primary well known 2-D 
permutation algorithms. The application at the permutations is performed on three steps: (1) one permutation algorithm 
is applied on the image; (2) the image that resulting from the first step is decomposed into four quarters. Pixels in each 
quarter image are then permuted with one of the permutation algorithms. The resulting encrypted quarters are 
combined as one image; (3) the encrypted image resulting from the second step is further encrypted by performing 
another permutation algorithm. Experimental results show efficient encryption that is simple in implementation and 
has high degree of security. It has several key points of strength such as the sequence in which the primary permutation 
algorithms are applied. 

Keywords: Permutation, Image Encryption, Image Decryption, correlation. 
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Abstract — In this paper, a Face Recognition Algorithm using Hu moment invariants (HMIs) is described for 
identifying human faces based on the facial component-features (FCFs). Algorithm is adopted by Viola Jones detector 
which is applied the concept on the AdaBoost algorithm for detecting the face from a face database having diverse 
illuminations and expressions with complex background. Then only the face region is cropped and illumination 
correction is done using histogram equalization technique. Finally, face is converted into binary image by applying 
cumulative distribution function (CDF) with adaptive thresholding. Three types of statistical pattern matching tools 
such as Standard deviation of Hu moment invariants (StdDevHMI), absolute difference of probability of white pixels 
(AbsDiffPWP) and pixel brightness values (PBVs) through L2 norms are determined using five facial components 
such as two eyes, nose, mouth and whole face for both binary and gray level images, respectively. Lastly, face 
recognition is carried out by taking these statistical pattern matching tools with logical and conditional operators along 
with appropriate threshold values. Experimental studies are performed on the BioID database and algorithm shows a 
better result as compare to the existing popular methods. 

Keywords — Cumulative distribution function, adaptive thresholding, probability of white pixels, facial component- 
features, shape matching, Hu moment invariants, pixel brightness values. 
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Abstract - In this paper, an enhanced optical flow analysis based moving vehicle detection and tracking system has 
been developed. A novel multidirectional brightness-intensity constraints (MBIGC) estimation and fusion based 
optical flow analysis (MDFOA) technique has been proposed that performs simultaneous pixel’s intensity and velocity 
estimation in a moving frame for detecting and tracking the moving vehicle. The conventional Lucas Kanade and 
Horn Schunck optical flow analysis algorithms have been enhanced by incorporating a multidirectional BIGC 
estimation, which has been further enriched with a non-linear adaptive median filter based denoising. Such novelties 
have significantly enhanced the video segmentation and detection. A vector magnitude threshold based MDOFA 
algorithm has been developed for motion vector retrieval that eventually enables swift and precise moving vehicle 
segmentation from the background frame. A heuristic filtering based blog analysis has been applied for vehicle 
tracking. The MATLAB based simulation reveals that MDFOA-HS outperforms LK in terms of execution time and 
detection accuracy. In addition, the accurate traffic density estimation affirms robustness of the proposed system to be 
used in intelligent transport system. 

Keywords: Multidirectional brightness -intensity constraint Optical flow analysis, intelligent transport system, Lucas 
Kanade, Horn Schunck. 
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Abstract - Quantum-dot Cellular Automata (QCA) is one of the most significant technology among the Nano devices 
for computing at the Nanoscale. The key logic elements in QCA are majority gate and inverter. The majority gates are 
3 -input majority gate and 5-input majority gate. In earlier designs all the digital logic circuits are implemented using 
3-input majority gate based on 2:1 multiplexer. The limitations of the 3-input majority gate are it requires the number 
of cells for constructing large architectures involves high complexity, connectivity is difficult, laborious and low 
reliability. Hence, the design of digital circuits in this paper is implemented with 5-input majority gate based 2:1 
multiplexer. The 5-input majority gate reduces cell counts, the number of clocks required and area compared to 
existing designs. The proposed designs such as XOR gate, XNOR gate, D-latch, D flip-flop, T-latch, and T flip-flop 
have significant improvements regarding the number of gates, cell count, and delay. The proposed circuits are 
simulated with QCADesigner and results were included to verify the functionality. 

Keywords: Quantum-dot Cellular Automata (QCA), Five-input Majority gate, Multiplexer, Logic gates, Sequential 
logic. 
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Abstract - Humans are unpredictable; there is no exact way or definition of emotion prediction. Detection of human 
emotion is difficult because when we want to observe people’s behavior then they behave in normal way or better 
than abnormal behavior. May be another way where people want to collaborate with others to share their emotions, 
their daily basis problems, where they feel easy to share their expression without any fear. Maximum people are not 
agreeing to share their emotion due to shame and fear. We need a platform where people can share their actual problem 
(which they are internally facing) and release their frustration. Many people want solution without sharing of their 


problems to anyone. In order to solve this problem, social media is a best way where people can share their emotional 
behavior without any fear and we can detect their emotion as silent observer through social media. In this paper we 
will analyze their posted data on social media and we have provided the suggestion to solve their problems; also we 
detected the emotion of people through social media. We collected data from social website (Twitter .etc.) where 
people have shared their thoughts or feelings. Meanwhile, we designed an algorithm which takes data from that social 
website and on the basis of that data; application provides the result as previous emotional state of a person. A 
systematic approach was used to detect the emotion of people through social media data. This is a better way where a 
person wants to collaborate with other to share his emotions, his daily basis problems and he feels easy to share his 
expression without getting panic. This Emotional based approach described things in a new way, where all predictions 
can be measured according to the subject environment and application can provide better results in decision making. 
This approach has used the data from social portals like Twitter etc. where peoples are posting their data in form of 
emotions. Prediction and recognition of emotions is a better way to analyze the emotion of people as silent observers. 

Keywords — Emotion, Silent Observer, Parts of Speech (POS), Social Media(SM), Adjective 
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Abstract - The video detection based on the image sequence of the area of interest has attracted considerable attention. 
Particles filtration is one of the most development algorithms particularly in restoration of probability density function 
of goal state. Accordingly, the main objective of present study is utilization of adaptive algorithm for detection of 
inflexible objects. The simulation method was applied and data analysis is done by MATLAB software. The results 
represent that, filtration of the suggested particle achieved better performance than filtration of the standard particle 
in terms of prediction error of status, detection of video error, and the number of significant particles. It revealed that, 
the particle filtering enhanced the number of significant particles by IGA and, forced the collection of particles to 
better expression of actual status. This could enhance the accuracy of status prediction and reduced the error. 

Keywords: adaptive algorithm, inflexible, objects detection, particle filtration 
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Abstract - The software industry can be widely seen as a key driver for business improvement. Outsourcing of software 
development tasks has become a major issue for large software enterprises. Software outsourcing has been 
progressively increasing. However significant outsourcing failure rates have also been reported. Therefore, 
outsourcing occurred by the wrong decision can cause major technological and economic setbacks. The objective of 
this research is to develop a model for outsourcing in order to improve outsourcing process and to help out the 
organizations to overcome barriers (communication, coordination & quality) that may have a negative impact on 
software outsourcing as well as to improve their success rate. Literature is consulted to highlight various issues of 
outsourcing. A case study is conducted to validate the effectiveness of our proposed model. The purposed model 
contains different practices of agile which provide an effective way to improve coordination, quality assurance and 
reduces communication gaps in outsourcing. 


Index Terms- Agile, Outsourcing. 
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Abstract- Secure Software Development is an important issues for the software industry for couple of years as security 
issues in the software development life cycle are not easy to handle. Success of a software deeply depends on the fact 
that it is not easily vulnerable to security threats and breaches. Many organizations have made security guidelines to 
cope with these challenges to bring them in an organized and secure way. Besides so much advancements in the field, 
securing the software from vulnerabilities in not achieved in all modules of software development life cycle. The 
guidelines and methods designed for the secure software development have put a lot contributions but they are so 
verbose that these measures are nearly not implementable. In this paper a model is proposed for secure software 
development life cycle in model driven architecture level (MDA-SDLC). In the proposed model, modeling methods 
and approaches are used to ensure the advances in secure model driven architecture with simplified integrity of security 
modules in security critical software’s development lifecycles. 

Keywords — Model Driven Architecture, Security, SDLC, UML, 
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Abstract - Social persuade plays vital part in the product marketing. Though, it’s seldom been regarded in traditional 
Recommender systems (RS). This paper provides new paradigm RS which can exploit data in the social networks, 
with general approval of items, user preferences, and persuade from the social friends. The probabilistic representation 
is improved to build personalized recommendations like data. In world e-marketing, new commerce representations 
are normally introduced, new tendency started to materialize. Latest trend is the social networking websites, several 
of which concerned not only huge number of visitors and users, however online advertise company to put their ads on 
sites. This paper discovers online social networking like new e-marketing trend. We first inspect online social network 
like new web-based services, also evaluate social networks by other delegate web-based service. We extort 
information from real online social network, also our investigation of this huge dataset expose that friends contain 
tendency to choose similar items and provide similar ratings. The experimental outcome on the dataset illustrates that 
proposed scheme not only progress prediction accuracy RS but gives solution cold-start and data sparsity problems 
intrinsic in the collaborative filtering. Moreover, we recommend improving system performance by concern social 
networks semantic filtering, and authenticate its improvement through class project research. In this research we reveal 
how related friends may be choose for deduction based on the semantics friend relations and finer-grained customer 
ratings. Such technologies may be organized by mainly content providers. 

Keywords: Recommender systems, collaborative filtering, social network 
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Abstract — Now day development of software is describe by immediate process. Old systems have to take on the 
recent technologies; It can be achieved by changing or finding the features, I.e, Reengineering. Our proposed paper 
clarifies about the reengineering process of software. It also explains the efficient and better process in reengineering. 
There are two type common reengineering objectives. Improved feature: the existing software system will be of 
minimum quality, because of more changing during the time course. The main objective of reengineering is to increase 


software quality and to provide present working documentation. A higher quality degree is needed to enhance 
reliability, to minimize the maintenance cost, to develop maintainability, and to make for functional improvement. 

Keyword- Software Reengineering, Reverse Engineering, Enhanced Reengineering, SVM classification, Software 
component. 
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Abstract - Cloud computing provides IT services to users worldwide, Data centers in Clouds consume large amount 
of Energy leading to highly effective costs. Therefore green energy computing is solution for decreasing operational 
costs. This survey presents efficient resource allocation and Scheduling algorithm/Techniques analyzed on different 
network parameters without compromising network performance and SLA constraints. Results are analyzed on 
different measures, providing a significant cost saving and improvement in Energy Efficiency. 

Keywords: Data Centers, Virtualization, Consolidation, Virtual Machines, SLA 


86. PaperlD 310516145: Image Share Pane Tool: Image Sending Approach to Mobile via Bluetooth Device (pp. 
687-690) 
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Abstract — Nowadays, Microsoft Word is commonly used in various areas including industries and academia. 
Microsoft word has introduced great user friendly features, for instance, Screenshot and Screen Clipping, Smart 
lookup, Tell Me and others. Among them, Layout option button has given us to set objects with line in text. 
Furthermore, Different types of panes have provided for various tasks. Microsoft Word has given us a facility to greet 
with thumbnail image of every window you have opened at the moment. Many users while working on document 
need to insert or capturing images with Screenshot and Screen Clipping, they want to share inserted images to mobile 
via Bluetooth But, Users are disappointed because there is no any tool provided to accomplish that task and user takes 
a long procedure to apply for sharing images to mobile through the Bluetooth. This paper provides an application 
which helps users to send an inserted image via Bluetooth while working on Microsoft word and they do not to switch 
any window. By adding it into existing Microsoft Word it will helpful for people living across the world. 

Keywords- Screen Clipping; Layout Option; Share Option Button; Share Image Pane; Image capture format type 
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Abstract — The “pay-as-you-go” cloud computing model is an efficient alternative to store the data at a cheaper cost. 
Ensuring data security in cloud computing platforms is critical and has become one of the most significant concerns 
in the emerging field of cloud computing. The location of the servers where the data is stored and being accessed are 
not known to the end user. There are many numbers of different security models and algorithms which are applied to 
secure the data stored in the cloud. While these techniques are very nice, we cannot really always tell that they are 


“unhackable”. Given enough time, brains and tools any technique might be breakable because the techniques are not 
fine grained. The existing algorithms have their own flaws and so in this paper we proposed a method that is been 
improved in such a way that the data stored on the cloud is secured. The proposed method initially uses a lossless 
block division which divides the data into blocks and then division is applied storing the remainder and the group to 
which it belongs to separately and later we apply predicate encryption scheme on the data to be stored (remainder 
data) in which the keys correspond to predicates and cipher texts are associated with attributes. The public key PK 
with an attribute ‘x’ is used to encrypt the text and the secret key SKf corresponding to predicate f can be used to 
decrypt a cipher text with attribute ‘x’ if and only if f(x)=l. 

Keywords: Block Division, Predicate Encryption, Predicates, Attributes, Secret Key 
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Abstract - Radio Frequency Identification RFID is one of the most important technologies used in the internet of 
things. It is increasingly used in various applications because of their high quality as well as their low costs; however 
the avoidance of collision of tags during the identification process represents a great challenge, especially when the 
number of tags is too large. In this paper we propose a new mechanism, based on Progressive Scanning Algorithm, to 
group tags in the interrogation zone of a reader. The proposed mechanism consists in the deployment of two readers 
having the same interrogation zone. Simulated results show that the proposed mechanism can appropriately achieve 
higher performance compared to other existing algorithms in terms of the number of time slots allowing identifying 
tags and effectively in terms of total time required to do this. 
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Abstract - Automatic web pages' classification is one way to deal with the increasing range of the World Wide Web. 
Considering that most of the content of web pages is text, so classification based on text is seems to be an efficient 
solution. The methods used for text classification are usually based on the key words. But if illusive keywords appear 
within the web page, then the class of the webpage will not be properly diagnosed. Therefore, rather than paying 
attention to the words, it is needed to be given to content and words meaning. In this paper, a method based on content 
semantic correlation has been proposed. A text consists of paragraphs, sentences and words. In this study at first text 
is divided into its components and stop words is removed. Then, in order to forms the basis of the words, it will be 
needed to find the root of the words. The Hypemyms Tree of words can be extracted by using FARSNET. By using 
this method not only is the meaning of the terms considered but also there is no need to clarify the words. After 
extracting the Hypernyms Tree for all keywords, text feature vector is created. Then the similarity of the text to each 
of the available categories measured. Finally, KNN classification algorithm is used to recognize the right class of the 
webpage. The results show that by using this method, classification accuracy is increased by 0.17 in compared with 
other methods. 
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Abstract - Unlike classical information retrieval systems, the systems that treat structured documents include the 
structural dimension through the document and query comparison. Thus, the relevant results are all elements that 
match the user needs rather than the entire document. In such a case, the document and query structure should be taken 
into account in the retrieval process as well as during the reformulation. Query reformulation should also include the 
structural dimension. In this paper, we propose an approach of query reformulation based on structural relevance 
feedback. We start from the original query and the fragments judged as relevant by the user. The analysis of the 
structure of document fragments and textual content of elements enables identify elements that match the user query 
and rebuild it during the relevance feedback step. The main goal of this paper is to show the impact query reformulation 
based on an analysis of the structure and content of each relevant element retrieved by an initial search process. Some 
experiments have been undertaken into a dataset provided by INEX to show the effectiveness of our proposals. 

Keywords: Information retrieval; XML document; relevance feedback; Line of descent matrix; Classification. 


91. PaperlD 310516181: An Access Fairness Resource Provisioning of Services for Energy Efficiency in 
Wireless Cellular Ad-hoc Network (pp. 735-747) 

Sridhara S. B., Assistant professor Department of Electronic and Communication, Rajiv Gandhi Institute of 
Technology, VTU, Bangalore, India 

Dr. Ramesh B., Professor Department of Computer Science and Engineering, Malnad College of Engineering, VTU, 
Hassan, India 

Abstract - The recent growth and development of smart phone technology have resulted in the growth of production 
of low cost smart phone devices. Due to the availability of low costs smart devices have resulted in increasing in the 
number of application and its user. The users in cellular network are mobile in nature and varied application services 
is been used such as FTP (File Transfer Protocol), VoIP (Voice over Internet Protocol), Multimedia services 
etc... which requires different data rate for each services. To assure a QoS (Quality of Services) for this kind of user 
application dynamic requirement and is a challenge that exists in existing wireless cellular adhoc network that need 
to be addressed. To achieve an efficient QoS & D2D (Device to Device) architecture is required. Many existing work 
based on D2D on cellular network have been proposed in recent times but they are not efficient in term of access 
fairness for varied traffic classes and it induces high cost of deployment since it require new infrastructure. To 
overcome this here the author adopts a cost effective D2D multicast communication based on pre-processed cellular 
infrastructure graph and admission control strategy for selectivity of services of varied traffic size in order achieve an 
efficient access fairness that reduces the packet drop rate and improves the overall packet delivery ratio of the network. 
The simulation outcomes show that the proposed model reduces the packet drop rate and improves the packet delivery 
ratio of the cellular ad-hoc network. 

Keyword: Admission control, cellular network, graph pre-processing, d2d, routing. 
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Abstract - Brainstorming is a technique for generating a large number of ideas for creative problem solving. The 
generation of new ideas, especially high quality creative ideas is important for a problem. It is a popular method of 
group interaction in both educational and business sectors. Brainstorming engenders synergy i.e., an idea from one 
participant can trigger a new idea in another participant. Brainstorming must been recognized as an effective group 
decision supporting approach. This paper discusses about some of the variations of Brainstorming techniques and 


previous approaches carried out to improve the quantity and quality of ideas, significance of creative thinking, target 
to increase productivity, requirement of group brainstorming and effectiveness of E-Brainstorming. 

Keywords: Brainstorming, Decision Support System, Creativity, Management Information System. 
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Abstract - Diabetes Mellitus is a chronic metabolic disorder. Normally, with a proper adjusting of blood glucose levels 
(BGLs), diabetic patients could live a normal life without the risk of having serious complications that normally 
developed in the long run. However, blood glucose levels of most diabetic patients are not well controlled for many 
reasons. Although the traditional prevention techniques such as eating healthy food and conducting physical exercise 
are important for the diabetic patients to control their BGLs, however taking the proper amount of insulin dosage has 
the crucial rule in the treatment process. In this paper we have proposed a model based on artificial neural network 
(ANN) to predict the proper amount of insulin needed for the diabetic patient. The proposed model was trained and 
tested using several patients’ data containing many factors such as weight, fast blood sugar and gender. The proposed 
model showed good results in predicting the appropriate amount of insulin dosage. 

Keywords: Diabetes, Artificial Neural Network (ANN), Blood Glucose Levels (BGLs) 
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Abstract - Process Management is one of the primary tasks achieved by the Operating Systems. The system’s 
performance sententiously depends upon CPU scheduling algorithms. Round Robin, contemplated as the most 
extensively endorsed CPU scheduling algorithm, is an optimal solution for the timeshared systems. In timeshared 
systems, selection of the time quantum plays a pivotal role in performance of CPU. In Round Robin, the static nature 
of the time quantum emerges some problems directly related to the quantum size which decreases the performance of 
CPU. In this paper, selection of time quantum is reviewed and a new algorithm for CPU scheduling, Optimum 
Dynamic Time Slicing Using Round Robin (ODTSRR) is proposed for timeshared systems. The proposed algorithm 
is based upon dynamic time quantum. Round Robin algorithm is redressed in this paper, ODTSRR also contains the 
advantages of RR (Round Robin) CPU scheduling algorithm have less chances of starvation. Performance of proposed 
algorithm is compared with RR and other shades of RR and the results revealed that the proposed algorithm is better 
in response time & waiting time, context switch rates, turnaround time and throughput hence resulting in optimized 
CPU performance. 

Keywords: Operating System, Scheduling, Round Robin CPU scheduling algorithm, Time Quantum, Context 
switching, Response time,, Turnaround time, Waiting time, fairness. 
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Abstract - Recommendation has been a major area that any recruiter would look for on a given job description. Increase 
in digital communication has made things easy to upload resumes and make it available for recruiters; on the other 
hand increase in technologies would make any recruiter difficult to scan it manually. Here we introduce an application 
which processes text data, understands sentence behavior unlike conventional keyword search applications and gives 
out required resume as per job description provided to application. This application makes use of Natural Language 
Processing (NLP) which helps in data training and feature extraction of the text data. Using NLP methods, semi 
structured text data is converted to structured format with required extracted features. To make this application scalable 
to any size of data we propose this implementation on Hadoop framework, which can handle any number of resumes 
or even more than petabytes of data, termed as bigdata. 

Keywords: BigData, Attribute Tagger, NLP Methods, Named Entity Recognition (NER), Map-Reduce, Hadoop, 
HBase, Hive 
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Abstract - With the immense increase in the processing power over the past few decades, battery life has proved to be 
a crucial resource. Since energy varies quadratically with voltage in the CMOS based processors, Dynamic Voltage 
Scaling (DVS) offers a solution to conserve the battery power by lowering the supply voltage. However, reducing the 
voltage increases the execution time and therefore, real time scheduling has to be combined with DVS so as to provide 
the deadline guarantee. This paper presents an algorithm, Recurring Variable Voltage Scheduling(RVVS) to extend 
the battery life using a combination of variable voltage and a real time scheduling algorithm (Earliest Deadline First). 
The paper also mathematically proves that if two voltage levels are used such that one is twice the other, up to 50% 
energy can be saved. Mathematical proof of delay increment due to voltage reduction has also been presented. RVVS 
has been optimized in order to reduce the overall energy dissipated by switching by introducing a factor ‘n’ that 
denotes the number of time units after which the voltage switch can occur. RVVS has been applied to task sets having 
different number of tasks providing an average energy saving of 27%. This significant amount of energy saving helps 
extending the battery life to a remarkable extent and proves the worth of RVVS in the field of real time DVS. 

Keywords: Dynamic Voltage Scaling; Earliest Deadline First; Real time scheduling; Voltage switching; Energy 
efficiency; Variable voltage 


97. PaperlD 310516202: Design and Detection of Network Covert Channel - An Overview (pp. 821-828) 

R. Rajamenakshi, Department of Computer Science, Avinashilingam Deemed University, Coimbatore, India 
Dr. G. Padmavathi, Department of Computer Science, Avinashilingam Deemed University, Coimbatore, India 

Abstract - Sensitive information leakage is increasing due to wide spread use of internet and technology. The attackers 
find new ways to exfiltrate data that pose threat to data security and privacy. Here our focus is on the covert information 
leakage over the network that exploits the various network protocols and their behavior. Information leak over covert 
channels exploit a variety of protocols of network protocols including Wireless, mobile and virtualized cloud platforms 
etc. Current network security solutions like IDS, IPS, firewalls etc. are not designed to handle these type of attacks. 
These type of attacks are dynamic in nature and mimics the legitimate traffic behavior, there by posing a challenge to 
detect and prevent. This article presents comprehensive review of the network covert channel, design, detection and 
mitigation. We have reviewed the classification of covert channels based on the attacks. 
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Abstract — In this paper we introduce and study a new sort of intuitionistic fuzzy interior -hyperideals of a - 
semihypergroup, called ( , )-intuitionistic fuzzy interior -hyperideals by using the combined notions of 
belongingness and quasicoincidence of intuitionistic fuzzy points and intuitionistic fuzzy sets and some interesting 
properties are investigated. We show that an IFS A = ( A, A) is an ( ^ , ^ V q)-intuitionistic fuzzy interior - 
hyperideal ofH if and only ifU(t, s) ={x £ H: x(t, s) £ A} for all t £ (0,0.5] and s £ [0.5, 1) is interior r -hyperideal 
of H. Moreover, we show that an IFS A = ( A, A) is an ( ^ ^ V q)- intuitionistic fuzzy interior -hyperideal of 
H if and only if [A](t, s) ={x £ H: x(t, s) £ VqA}for all t £ (0, 1] and s £ [0, 1) is an interior -hyperideal of H. 
These showed that ( £ , £ V q)-intuitionistic fuzzy interior -hyperideals of H are generalization of existence of 
intuitionistic fuzzy interior r-hyperideal of FI. 

Keywords: Semigroup, Intuitionistic fuzzy point; Intuitionistic fuzzy sets; ( , ) -Intuitionistic fuzzy interior ideal. 
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Abstract: There are many aggregation operators and its applications have been developed up to date, but in this paper, 
we develop the Pythagorean fuzzy hybrid geometric (PFHG) operator, and also study some properties, such as 
monotonicity, idempotency, and boundedness of the proposed operator. Pythagorean fuzzy hybrid geometric operator 
is the generalization of the Pythagorean fuzzy weighted geometric (PFWG) operator and the Pythagorean fuzzy 
ordered weighted geometric (PFOWG) operator. Finally, we apply the Pythagorean fuzzy hybrid geometric (PFHG) 
operator to deal with multiple attribute decision making (MADM) problems under Pythagorean fuzzy information. 
Using Pythagorean fuzzy hybrid geometric aggregation operator, we also develop an algorithm for multiple attribute 
decision making (MADM) problems. Lastly we construct an example for multiple attribute decision making 
MADM problems. 

Key words: Pythagorean fuzzy sets, Pythagorean fuzzy hybrid geometric PFHG operator. Decision making 
problems. 
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Abstract - Application of new technologies is considered as a key factor for the development of companies in recent 
years. This puts emphasis on the importance of reviewing factors influencing the acceptance of information technology 
culture. This study has been done aiming to identify factors influencing the information technology acceptance in 
companies located in the Tehran science and technology park. 80 companies from industries based in science and 
technology parks in Tehran were selected of these, 72 questionnaires have been evaluated and Cronbach's alpha was 
used to measure the reliability and validity of measurement tools. The reliability coefficient of the questionnaire is 
0.86, which indicates high reliability of the applied questionnaire and content validity was confirmed by instructors. 
The research data is analyzed by SPSS which uses the correlation analysis along with significance levels and in the 
following, t and f tests have been used to study the research additional hypotheses. The results of this study showed 
that the usefulness and ease of use and subjective norms affect the information technology acceptance through 


behavior intent and using independent ttest, it was found that looking at research indicators is alike among men and 
women. Based on the f statistics, attitude to these indices among different education levels is different and the 
respondents’ education has an impact on attitudes to these indicators. 

Keywords: cultural factors, Information Technology, technology acceptance, TAM, UTA 
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Abstract — Signcrypion is a cryptographic method in which 
signature and encryption apply on message in a single step. On 
other hand image steganography is a strongest technique for 
hiding data or information. Therefore Communication through 
insecure channel is challengeable task for an organization. 
Recently two tier security gain popularity because most of the 
business organizations wants maximum security of 
data/information. In this paper we design a new scheme using 
cryptographic and stenographic techniques at once on the basis 
of image steganography and elliptic curve cryptography. In 
proposed design scheme we use both of the steganography as 
well as cryptography. The cryptographic technique encrypts 
the data by using Elliptic curve cryptography in such a manner 
that third party not understands the original message contents. 
Stenographic technique is used to hide the text in image and 
then we take hash as well as signature. It also assures the 
security properties like message confidentiality, message 
integrity, message non repudiation and also message 
authentication. 

Keywords-component Cryptography, Steganography, 
Signcrypion, Elliptic curve cryptography. (Key words) 


I. Introduction 

Communicating data through noisy channel needs Secrecy. 
Most of the organization needs maximum security to protect 
their data from third party. Thus we collectively used 
steganographic as well as cryptographic techniques for better 
security. Cryptography is a technique in which plain text 
(message) is converting into cipher text using key and 
encryption algorithm. While steganography used for hiding of 
data. Steganography is derived from two Greek word 
“Stegano” mean hide and “graphics” means writing [1]. So it 
is the art of science in which we study how to hide secure data. 
The major aims of steganography is that the text of secret 
message is hidden or conceal while in that of cryptography the 
text of secret message is readable but not understandable to 
third party. Sometime needs the extra security properties like 
authentication and integrity. Thus integrity ensures by one 
way hash function and authentication provided by digital 
signature algorithm. Public key cryptography was introduced 
by deffi and helmen [2] .This technique uses two keys(public 
and private) at both side (sender and receiver) .It was build for 
those communicating parties in which they cannot meet before 
to start communication. Thus signature generation and 
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encryption is so costly, zheng give solution to this problem by 
contributing signcrypion [3]. It combines both the 
functionality of digital signature and encryption in the single 
step. But this is not enough for maximum security. We 
proposed two tier schemes on the bases of signcrypion using 
elliptic curve and image steganography for hiding the cipher 
text. It also assures the security properties like message 
Confidentiality, message integrity, message non repudiation 
and also message authentication. 

II. PRELIMINARIES 
Some basic preliminary are define bellow. 

• Elliptic curve 

Firstly ECC can be divided into curves one is prime curve 
(TP) and another is binary curve (ZF (2m)). Prime curve (TP) 
is to use in software applications, because it not required 
extended bitfidding operation, which (ZF (2m)) require. But 
hardware application are required a few logic gates to build a 
dominant crypto-system used a binary curves (ZF (2m)). 
Secondly, ECC variable and coefficients are limited in a finite 
field of elliptic curve, this limitation would to increase the 
efficiency of ECC computational operations. 

In finite field T p mod p an elliptic curve is represent 
as Ep ( a , b) \ y2 = x3 + ax ( mod p) 
where (a, b ) € Tp and 4a3 + 27b2 mod p ^ 0. The 
condition 4a3 + 27 b2 mod P is essential to certify that 
y2 = x3 + ax + b (mod p) has no repeated factors in 
finite Abelian group in the set of Ep ( a , b ). 

The point 0 is a zero point of an elliptic curve. This point at 
infinity is third point of intersection of any straight line with 
the curve. Points (x, y), (x. — y) and 0 is on any straight line 
with the curve. Now addition rules, which is denoted by “+” 
are given below: 

1) 0 + p = p and p + 0 = P , where 0 is additive 
identity. 

2 ) —0 = 0 


3) +(— p) = (— p) + p = 0 , where -p is negative points of 

p. 

4 ) (p + Q) + R = v + (Q + R) 

5) p + Q= Q + p 

For any two points p = xp,yp and Q= xq,yq over Ep(a,b) 

the elliptic curve addition operation, which is denoted as P + 

Q = R = (, xr , yr ), satisfies following equations. 

Xr = (A2 — xp — xq)modP ; Yr 

= (A (xp, xr) — yp)mod p 

Where 

A = (yq — yp/xq — xp ) mod p,if P ^ Q 
Where A = (3x2p + a/2yp) mod q,if P = Q 

• Plaintext. Message before encryption is known as 
Plaintext. It is the original message that sender want 
to send the receiver. Plaintext is understandable as 
well as readable. 

• Cipher text. Message obtained after encryption is 
known as Cipher text. The cipher text is readable but 
not understandable. 

• Cryptology. The study of both cryptography and 
cryptanalysis is known as Cryptology. In cryptology 
both the encryption and decryption phases occurred. 

• Cryptanalysis. The breakdown of Cipher text to 
plaintext without knowing key is known as 
Cryptanalysis. In Cryptanalysis the not 
understandable text is transform to understandable 
text (from cipher text to Plaintext). 

• Cryptosystems. Message encrypted by Computer 
system for the purpose of secure data transmission 
and storage known as Cryptosystem. 

• Key. It is variable values that are used in both in 
encryption as well as decryption stage. 

• Cover object. Original object in which we embed 
secret data or message is known as cover object. 

• Stego object. After embedding data in cover object 
stego object is formed. 
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• Steganalysis. The breakdown of stego object into 
cover object is known as Steganalysis. 

• Hash functions. It is a function which is used for 
digest of a message. Hash is a one way function. 

III. RELATED WORKS 

For confidentiality and integrity of data two types of 
techniques are used for data hiding which are steganography 
and cryptography [4]. In steganography information are being 
transmitted through a channel through which other kind of 
information’s already transmitted [5]. The aim of 
steganography is to hide information inside other 
“undamaging” digital media in such a way that third party not 
detects the existence of secret information [6]. The major aim 
of steganography is to communicate two different parties in 
such a way that the third party is unaware from transmission 
of hidden data [7]. 

In cryptography the text of secret message is visible to third 
party. The text is readable but not understandable to third 
party [6] . In cryptography, the format of data is in such a way 
that the text is worthless and unintelligible way [8]. The 
cryptography prevents the hacker from reading the message 
which has been sends by or to sender [9]. 

For better security we have to combine both of Steganography 
and Cryptography. The Cryptography encrypts the message 
while the Steganography hide the secret message which 
provides more security to secret information [4, 10]. Without 
viewing the stego object is being transmitted from sender to 
receiver and vice versa. Whenever, if an attacker detects the 
message from the stego object, so first he had to decode the 
message from digital media. Then he had to require the 
cryptographic algorithm to decipher the encrypted message 
[ 11 ]. 

The author suggests the designing of strong and secured image 
steganography based on least significant bit (LSB) insertion 
technique [12]. 


In [13] the authors suggest a technique that provides better 
security by using two tier techniques (cryptographic and 
steganography) at the same time. 


IV. PROPOSED SCHEME 

This section includes details discussion about the proposed 
two tier scheme. It includes three phases namely called key 
generation phase, Signcrypion and un-Signcrypion. 

Key generation phase 

In this phase the sender and receiver select their private keys 
and also generates there public keys. 

• Sender Pick5 a from {1, 2, 3, ..., t — 1} as a private 
key & Calculate their public key 5), = S a Sa. G. 

• Receiver Pick P v from {1, 2, 3, ... ., t — 1} as a 
private key & Calculate their public key d v = P V .G. 

Signcrypion Phase 

In This section the sender first generate the signcryptext of a 
message M after that then generate the stego image of the 
signcryptext. Now stego image send to the receiver. 

Algorithm 

(1) Generate a random no. x € {1,2,3, , t — 1} 

(2) Calculate K — x.P v mod t. 

(3) split K = K1,K2 

(4) C - Kl(m) 

(5) d = hide(C) 

(6) r = /i(d) 

(7) R — r.G 

(8) S = x/r + S a 

Send to receiver (5, r, R, d ) 
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Block Diagram 

The block diagram of a Signcryption phase is followed below. 


Figure 1: Block diagram of encryption phase. 

Explanations 

The Signcryption phase consists of following steps. First the 
secret message is encrypted through Elliptic curve 
cryptography into cipher text. Than we hide the cipher text 
through a cover image through steganographic technique. 
Than we take signature and hash of stego file mean combine 
hash of cipher text as well as cover image. We get a stego 
image (d), signature (S) and hash (h) value. Than the sender 
send stego image (d), hash (h) and signature (S) to receiver 
side. 

Un-Signcrypion Phase 

The un-Signcryption phase consists of following steps 
which is explained in following algorithm. 

Algorithm 

(1) K1,K2 = S.d v (S b +R) 


Vol 14, No. 6, June 2016 

(2) Recover d 

(3) Decrypt M = DK1(C ) 

(4) Check r.G — R than Accept otherwise reject. 

A. Block Diagram 

The block diagram of the un-Signcryption phase is following. 


Figure 2: Block diagram of Decryption phase. 

Explanations 

The receiver receives the stego image (d), Hash value (h) and 
signature (S). Than we apply steganographic technique we get 
the RGB pixel value of stego image and hence we get a cover 
image and cipher text. Than we takes the elliptic curve 
cryptographic decryption and hence we get plaintext. 

The receiver receives the stego image (d), Hash value (h) and 
signature (S). Then the receiver gets the LSB of stego image 
and has to decrypt it through elliptic curve cryptography. By 
decryption we get cipher text. Than we apply steganographic 
method to get cover image and plain text. At last we take 
Hash function of plaintext and cover image we get Hash value 
new. If the Hash values old are equal to Hash value new than 
it shows integrity. If the Hash value old and Hash value new 
became different than it not provide integrity. 
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VI. CONCLUSIONS 


V. SECURITY ANALYSIS 
The security of our proposed design scheme is based on 
ECDLP. In security analysis we focus on main security 
features like integrity, Confidentiality, authentication and non- 
repudiation. 

Confidentiality 

The proposed scheme provides the Confidentiality property. If 
the attacker wants to decrypt the message for that the attacker 
needs to get x as well as K1 which is the public key of 
receiver. Finding two unknown parameter from one equation 
is difficult as well as impossible for attacker. This problem is 
known as discreet logarithm problem due to which the attacker 
cannot revels the original contents of message due to which 
our scheme provide confidentiality property. 

Integrity 

Our scheme provides the property of integrity. As in proposed 
scheme we use hash function which has one way property. 
Hash provides the irreversibility property due to which our 
scheme is more secure than existing schemes. The hash 
function use in our scheme provides the property of integrity. 

Authentication 

Our proposed scheme provides the property of authentication. 
When the attacker generate a forge sign, for this it required the 
randomly generate number x and private key of sender s a from 
the mentioned equation. Now calculating two unknown 
variables from same equation is hard for attacker. 

Non repudiation 

Our scheme provides the property of non-repudiation. If the 
sender deny from message which has been send by sender. 
The third party proves that the message is being sent by sender 
or not from private key of sender. This property is known as 
non-repudiation. 


A new and advance two tier secure steganography provide 
better security to our data. Cryptography and steganography 
are the techniques used for security of data. Cryptography is 
technique of secret writing while the steganography hide the 
secret messages. In this paper we are going to discuss the two 
tire concept in which both the cryptographic and stenographic 
techniques are used for security of data. The proposed two tier 
scheme is more efficient on the basis of security. Our scheme 
is more secure as compare to the other existing schemes which 
are discussed in the above literature. The proposed scheme 
provides the properties of Confidentiality, integrity, non- 
repudiation and authentication. 

REFERENCES 

[1] Pfitzmann, B., Information hiding terminology - 
results of an informal plenary meeting and additional 
proposals. In: Proceedings of the First International 
Workshop on Information Hiding. Springer- Verlag, 
London, UK, pp. 347-350. ( 1996 ). 

[2] Diffie, W., & Heilman, M. E. (1976).New directions 
in cryptography. Information Theory, IEEE 
Transactions on, 22(6), 644-654. 

[3] Yuliang Zheng, “Signcryption and Its Applications in 
Efficient Public Key Solutions ,” An invited lecture at 
the 1997 Information Security Workshop (ISW'97), 
LNCS, Vol. 1397, pp.291-312, Springer- Verlag, 
1998. 

[4] A. J. Raphael and V. Sundaram, “Cryptography and 
Steganography - A Survey”, Int. J. Comp. Tech. 
Appl., Vol 2 (3), pp. 626-630 , ISSN:2229-6093. 

[5] B. Li et al , “A Survey on Image Steganography and 
Steganalysis”, Journal 

of Information Hiding and Multimedia Signal 
Processing, Vol. 2, No. 2, pp. 142-172, April, 2011, 
ISSN 2073-4212. 

S. A. Laskar and K. Hemachandran, “An Analysis of 
Steganography and Steganalysis Techniques”, Assam 


239 https://sites.google.com/site/ijcsis/ 

ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol. 14, No. 6, June 2016 


University Journal of Science and Technology, Vol.9, 
No.II, pp. 83-103, January, 2012, ISSN: 0975-2773. 

[7] N.F. Johnson and S. Jajodia, “Exploring 

Steganography: Seeing the Unseen”, IEEE, 

Computer, vol. 

31, no. 2, pp. 26-34, Feb. 1998. 

[8] G. J. Simmons, "Subliminal Channels: Past and 

Present," European Transactions on 

Telecommunications, Vol. 4, No. 4, pp. 459-473, 
Aug 1994. 

[9] R. Anderson, “Cryptanalytic Properties of Short 
Substitution Ciphers”, Taylor & Francis, Cryptologia, 
Vol. XIII, No. 1, pp. 61-72, January, 1989. 

[10] S. Song et al, “A Novel Secure Communication 

Protocol Combining Steganography and 

Cryptography”, Elsevier Inc, Advanced in Control 


Engineering and Information Science, Vol. 15, pp. 
2767-2772, 2011. 

[11] M. A. Fadhil, “A Novel Steganography- 
Cryptography System”, Proceedings of the World 
Congress on Engineering and Computer Science 
2010, USA, Vol. I, October, 2010, ISSN: 2078-0966. 

[12] Jinsuk et al , “(N,l) Secret Sharing Approach Based 
on Steganography with Gray Digital Images”, 
Wireless Communications, Networking and 
Information Security (WCNIS), IEEE International 
Conference, 2010, pp-325 - 329 

[13] Mahajan et al , "Enhance Two-Tier Secure Model of 
Modern Image Steganography." (2014). 


240 


https://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



International Journal of Computer Science and Information Security (IJCSIS), 
Vol. 14, No. 6, June 2016 


Formal Model of Smart Traffic Monitoring and 

Guidance System 

Umber Noureen Abbas, Farhan Ullah, Nazir Ahmad Zafar 

Department of Computer Science, COMSATS Institute of Information Technology Sahiwal 
COMSATS Road off GT road, Sahiwal 57000, Pakistan 


Abstract — Emergency Services Rescue 1122 and Smart Sticker 
components of our proposed Smart traffic monitoring and 
guidance system model are presented in this paper to provide 
smart emergency services and to identify vehicles to develop 
advanced transportation system. It involves the Wireless Sensors 
and actors to communicate with the system. The proposed 
components require fewer resources in terms of sensors and 
actors. Further, Sensors component identifies vehicles through 
Smart Stickers and it is readable through sensors from its 
barcode and barcode consists of vehicles details in terms of 
vehicles registration, model, engine and color. Secondly, 
Emergency Services Rescue 1122 component provides emergency 
services as it locates the vehicles through sensors and informs the 
local authority for providing emergency services. Third, violation 
of rules detects intruders on roads to provide smooth flow of 
traffic. Fourth, to avoid congestion, traffic signals are configured 
and communicated with sensors to update the system if 
congestion occurs. The proposed components of our model are 
implemented by developing formal specification using VDM-SL. 
VDM-SL is a formal specification language used for analysis of 
complex systems. The developed specification is validated, 
verified and analyzed using VDM-SL Toolbox. 

I. Introduction 

Advanced transportation system has the ability to handle 
complex situations in developed countries. A good 
transportation system is very important for the development of 
a country. It does ensure reliability and safety operations on 
roads. The main objectives of traffic management system are 
the ability to reduce congestion problems and overcome 
various diseases caused by traffic pollution. With the growth 
of population, traffic management is a big problem in Pakistan. 
Irregular traffic is a big problem in populated cities of 
Pakistan like Karachi, Lahore, Rawalpindi, Islamabad, 
Hyderabad, and Peshawar. Intelligent traffic management 
system is useful to overcome traffic problems and it can 
contribute a lot to the development of a smart city. Thousands 
of vehicles are passing through a city and due to poor traffic 
management system; it causes traffic congestion. 

Traffic congestion can cause, accidents, waste of time, 
reduces trade opportunity, increases energy consumption and 
negatively affect the education system. It has a big impact on 
society and thus create problems for everyone [1]. 

These issues will be solved if we develop smart traffic 
management system in which sensors are used to gather real- 
time data about traffic at specific points and then it will 
communicate with other sensors to replicate information to 
users. In this system, traffic lights and LED screens will be 


used as sensors to guide the users about the traffic situations. 
Smart sensors in traffic signals sense traffic situations to 
develop an intelligent transportation system [2, 3]. Smart 
Traffic Monitoring and Guidance System (STMGS) have a 
basic purpose of maintaining a balance among different types 
of vehicles. It provides security, road safety, less energy 
consumption, smooth traffic flow, and better guidance to users. 
In smart traffic management system sensors will gather real- 
time data from traffic flow and after analysis, this information 
will be uploaded on the cloud. Traffic management 
department will view the information from the cloud and will 
be able to take necessary decisions to make it better. 

We proposed a model for better traffic monitoring and 
management system to solve congestion problems. The model 
is called Smart Traffic Monitoring and Guidance System. 
Formal methods are effective ways to formalize the valid and 
invalid data [4]. Semi-formal and formal methods are two 
most important methods to write requirements. We transferred 
requirements from semi-formal way to formal way. Different 
types of formal methods like Z notation, B method, and 
VDM++ [5] are used for formal modelling. 

Currently we focused on emergency services rescue 1122, 
smart sticker, and violation of rules and traffic signals 
components in our model. We did formal modelling in Vienna 
Development Method Specification Language (VDM-SL) of 
emergency services rescue 1122, smart sticker, and violation 
of rules and traffic signals components in our model. This 
paper consists of seven sections which are Literature Review, 
Problem statement, Model of smart traffic monitoring and 
guidance system, UML use cases, Formal specification, 
Formal analysis with results and conclusion. 

II. Literature Review 

Traffic jam is a big problem in any metropolitan city. 
Numbers of vehicles are increasing day by day due to rise of 
standard living. Due to this problem there is a need of 
advanced research to develop an intelligent traffic system 
(ITS) which can work automatically to solve a problem or to 
investigate a problem in traffic. A model is proposed in [6] 
which uses centrally placed microcontroller and infrared 
proximity sensors which uses vehicular length to develop an 
intelligent traffic monitoring system. This system needs real 
time data to be collected for each situation, in a dynamic 
situation to gather data about traffic is a time consuming and 
expensive work. It uses UML (Unified modeling language) 
diagram to interact between user and system. In addition we 
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have used VDM-SL (Vienna Development Method- 
Specification language) to formalize and verify sensors data. 
A model is proposed in [7] which works on simple algorithm 
based on length of traffic lane. Time allotter on current lane 
affects the length of traffic on the other lanes. To determine 
the length of traffic proximity sensors are used instead of 
WAN. Intelligent traffic system (ITS) implemented through 
wireless sensor networks, Radio Frequency Identifier (RFID) 
and graph theory concepts to find the shortest path. This 
algorithm does not able to identify a vehicle or to update the 
driver about traffic situation at specific intersections. We have 
used smart sticker for each vehicle to identify vehicle and to 
update the system about the vehicle record. Second we have 
used Light emitting diode (LED’s) display to show traffic 
information to drivers at specific points. A new traffic signal 
control algorithm is proposed in [4, 8] to develop intelligent 
traffic system. The new traffic signal can measure the 
individual travel time of a vehicle. The control algorithm 
calculates the delay and then searches the optimal solution to 
minimize the total delay based on queuing model. We have 
used sensors to control the congestion at threshold value, to 
find the location of vehicles through Global Positioning 
system (GPS) and update rescue team about accidental 
situations. In [9, 10], authors proposed a system which used 
image processing technique to analyze the traffic flow to 
control the traffic. They used image sensors embedded with 
web based cameras to capture the movements of vehicles. 
This system cannot give correct result as image processing is 
very complex to analyze each situation of the traffic. Second 
this technique requires expensive hardware for image 
detection and processing. Unlike this, we have used smart 
sensors which just collect data about the traffic to update 
system. We proposed a model for smart traffic and guidance 
system in which we used sensors as a backbone. Sensors are 
configured at different places in a city. All sensors collect data 
from dynamic flow of traffic and update system and then 
system inform local authorities about the situation. We used 
VDM-SL for requirements specification to formalize and 
verified our requirements used in our model [5]. In this paper, 
author is presented moving block interlocking with its safety 
characteristics for avoiding collision and derailing of trains at 
the crucial areas of the network [26] . In this study, author has 
described the entire state space of critical components of 
moving block interlocking system by using graphs and in 
combination with z notation and also provided the 
specification of the work [27]. This paper presents a novel 
approach to model and analyse an important and critical 
component of moving block railway interlocking system that 
is railway crossing [28]. Author has explained formal methods 
as an advance software engineering technique and used Z 
notation for describing formal specification of critical m X 
modules of automated train control system [29]. Railway 
interlocking system is very critical in different ways like 
safety, environmentally, economically and its nature of 
distributed system, any failure may risk the human life, 
environment and money. The author has used formal 
specification languages such as VDM, Z-notation for its 


modelling using crisp (two-valued logic) theory [30]. As the 
traffic system also include all these risks and we have 
overcome these problems by our proposed system. 

Some of the other related work on application of formal 
method in critical systems can be found in [18-25]. 

III. Problem Statement 

In these days, the major focus of the industry is on smart 
systems. Smart systems are IOT based systems in which each 
component acts as a communicable device [2] alarm systems 
[11] and fire detection [3]. Developed countries are 
transferring their system on IOT. In Pakistan, there is no as 
such intelligent IOT system for transportation. As 
transportation is a major factor for health, education and other 
aspects of life. However, problem is that building such 
systems is very difficult because of their complexity [6]. 
Therefore, this field requires a lot of research and struggle to 
develop such a traffic management system in which all these 
issues can be solved. Such system requires real-time data 
analysis from traffic flow to find better solutions for every 
problem [12]. 

The main problem of building such system is requirement 
engineering. These requirements are not understandable to 
develop, due to this, we are building a formal model of 
requirements for smart traffic management system [13]. 

IV. The Model of Smart Traffic Monitoring and 
Guidance System 

Traffic monitoring and guidance system is an application 
of advanced technology to solve the traffic problems and to 
support smooth flow of traffic. Unbalanced signals and high 
load of traffic increases the problems in traffic flow. 

In our model we have described the main problems of 
traffic system which are traffic congestion, emergency 
incidents, vehicle identification through smart sticker, and 
smooth control of traffic signals, security and users guidance 
about traffic through Light Emitting Diode (LED) about at 
different points. Currently we have focused on smart sticker, 
emergency services rescue 1122 components in our model. 
Smart sticker has a barcode which consists of owner name, 
vehicle registration number, vehicle model number, vehicle 
engine number and color of vehicle. Smart sticker is pasted on 
front screen of vehicle. Sensors installed at different points 
will read the smart sticker from vehicles, extract information 
from it and then update the system about the vehicle. 
Emergency services rescue 1122 will be activated if any 
emergency occurs. Sensors will automatically detect 
miscellaneous activities. The traffic system automatically 
update through new information. The admin will be quickly 
informing the rescue 1122. These rescue services provide the 
medical treatment as soon as possible. 

Traffic signals are installed and configured to 
communicate with the sensors. Traffic signals gather real time 
data from traffic flow and report congestion to system if it 
reaches to a threshold value. Then Traffic light will turns to 
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green automatically. Then system will display updated 
information on LED’s (light emitting diode) to inform the 
users. Traffic signals will turns to green immediately if it 
detects any emergency vehicle which provide social and 
health services to people. 

Users must follow the traffic rules to overcome accidents 
on roads and to provide smooth flow of traffic. Traffic rules 
are not only necessary for vehicles but at the same time these 
are necessary for pedestrian. Sensors will operate on traffic 
rules and report the system about vehicles not following the 
traffic rules. There are many rules which are important for 
traffic flow but after the Second World War there are number 
of rules implemented to ensure the road safety [16]. 

Figure 1: shows a high level model of traffic system. We 
can say, it also represent the work we have done with VDM 
SL to formalize traffic system. This model is not fully 
functional traffic system but we tried to cover some important 
aspects of traffic system and because it is a critical system so 
we formalized all our work to show it is valid. 



Figure 1 : Smart traffic monitoring and guidance system Model 

Users must follow the traffic rules to overcome accidents 
on roads and to provide smooth flow of traffic. Traffic rules 
are not only necessary for vehicles but at the same time these 
are necessary for pedestrian. Sensors will operate on traffic 
rules and report the system about vehicles not following the 
traffic rules. There are many rules which are important for 
traffic flow but after the Second World War there are number 
of rules implemented to ensure the road safety [16]. 

Figure 1: shows a high level model of traffic system. We 
can say, it also represent the work we have done with VDM 
SL to formalize traffic system. This model is not fully 
functional traffic system but we tried to cover some important 
aspects of traffic system and because it is a critical system so 
we formalized all our work to show it is valid. 

V. UML Use cases 

A use case diagram displays the relationship among actors 
and use cases. Use case diagrams determined the relation 


between actor and performance action. Interaction between 
rescue team and system described through use case diagram. 
In our model we will focus on rescue 1122, smart sticker, 
Traffic signals and Violation of rules. In rescue 1122 use case 
has an actor and performs some action [17]. Traffic signal use 
case shows interaction between vehicle owner and traffic 
signals. 
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VI. Formal Specification of Modules 
A. Rescue module 

The functionality and specification of Emergency is 
formally modelled by using VDM-SL. Its properties are 
specified with the help of known modelling and specification 
language, VDM-SL. Firstly we describe the emergency 
service and then we will look closely into different functions. 
This service is designed for attending medical emergency 
situations on road, caused by traffic collision. Before taking 
patient to hospital it is necessary to give him first aid if he is 
serious, and if matter is not very serious then this service can 
help on road by giving basic medical help. 

So to implement this system, we need to know some 
information about the incident like location that is token type, 
type of emergency that is quote type. So to store information 
different variables are used in the specification. Emergency is 
defined by composite type which has related information 
about single emergency, different other variables are also 
involved. All the variables are of global type so all can easily 
be accessible anywhere in the program, 
types 
id= token; 

emergencyLocation = token; 
emergencyType = <MEDICAL> | <FIRE>; 
isEmergency =token; 

Sensorin formation for anyMi see 11 aneous=<EME 
RGENCYTYPE> | <EMEGENCYLOCATION>; 

Location = token; 

Sensors= token; 

These are the variables, id is used for giving unique id to 
every emergency, emergency type is defined with quote type 
because we are considering two type of medical emergencies, 
one Boolean variable is defined isEmergency to check either 
is there emergency or not. One variable is for location of 


emergency and other are sensors that are related to traffic 
monitoring. 

Emergency: : 
eNumber: id 

location: set of emergencyLocation 
type: set of emergencyType 
inv mk._Emergency (-, 1, t) == (card (1) = 
card(t)) and (({<FIRE>} subset t \ 

{ <MEDICAL> } or { <MEDICAL> } subset t \ 

{ <FIRE> } ) and { <FIRE>, <MEDICAL> } inter 
t<> { } ) ; 

This is like an object of emergency which have all the 
related emergency information, in VDM SL it is called 
composite data type. It have also invariant which verifies its 
integrity and correct initialization, so no emergency should be 
call when there is no emergency. It is checking that if there is 
any emergency then there must be a location and it is also 
checking emergency must be from one of its type either 
medical or fire. 

The next part of specification defines the state of the 
system. State of the system refers to permanent data that must 
be stored by the system for its operation. State of the system 
must have some variables to store data, initialization and 
invariant conditions that must be satisfied to use this system. 
As in other programming languages we cannot declare two 
variables with same name, same in VDM SL we can’t do this, 
so we have to declare names with modification in the system 
state. In system state we have used loc for taking locations 
that are coming from sensors. Moreover System maintain 
sensors and also it have variable for current location if there 
are any emergency currently going on. 

state rescuee of 

loc: map Sensors to Location 
sensors : set of Sensors 
currentlocation : set of Location 
get in formation for anyMi see llaneous : map 
Sensors to 

Sensorin formation for anyMi see llaneous 
anyemergency : set of isEmergency 
listOf Emergencies : map id to Emergency 
init mk_rescuee , c, - , an , a) ==a= { | ->} and 
c= { } and an= { } 
end 

There are different operations involved in the emergency 
services that provide facilities to users when an emergency 
occur. 

SetSensorOnBridge is the first operation that is very 
simple, it takes a sensor and a locations. What it does is, it 
only place that sensor on that location for monitoring. Pre- 
condition checks that sensor must not already place anywhere 
else and post conditions finally does sensor placement if pre- 
conditions returns true. We are using set notation to keep 
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sensors and for assignment we are mapping sensors to their 
locations. 

SetSensor sOnBridge (sensorin : Sensors , 

locationln: Location) 

ext wr loc: map Sensors to Location 

wr sensors : set of Sensors 

pre sensorin not in set dom loc 

post loc= loc munion{ sensorin \ -> 

locationln }; 

Getlnformation operation gives us information about the 
location, where the sensor is deployed. Once this method is 
called, it will return us information of activities happening 
there so that information can be provided to rescue if there is 
any accident. Operation have pre-condition that verifies the 
information of this accident is not already added, and if it 
return true then post condition will execute and the 
information of the current accident will be added to system. 

GetlnformtionForRescue ( get in format ion in : S 
ensorin formation for anyMi seel laneous , sen so 
rin : Sensors) 

ext wr getin formation f oranyMiscellaneous : 
map Sensors to 

Sensorin formation for anyMi seel laneous 
pre (getin formationin not in set rng 
getin formation f oranyMiscellaneous) and ( sen 
sorin in set dom loc) 

post getin formation forany Mi see 11 aneous= 
getin formation f oranyMiscellaneous 
munion{ getin formationin \ ->sensorin } ; 

NewEmergency operation will create a new emergency; it 
needs some information for creating a new emergency object 
that is added to system in the list of emergencies. To create a 
new emergency it needs ID, location of emergency and 
emergency type. 

NewEmergency ( i din : id, location: emergencyLo 

cation, type: emergency Type) 

ext wr listOf Emergencies : map id to 

Emergency 

pre ( idln in set dom listOf Emergencies) 
post listOf Emergencies = 
listOf Emergencies munion {idln |-> 
mk_Emergency ( idln, {location} , {type}) }; 

GetEmergency will store emergencies from the list that 
system is maintaining, we need this operation. This operation 
returns the desired emergency object from the list on calling 
this operation and providing it an ID. This function maps that 
id on a list, extract those elements and return us. 

Ge tEmergen cy ( i din : id) 

ext rd listOf Emergencies : map id to 

Emergency 


pre true 

post listOf Emergencies = {idln} <: 

1 i stOf Emergen cies; 

NumberOfEmergences is a simple method that is returning 
the total number of emergencies added to system. This 
operation is working like a central information system that is 
maintaining how many emergencies are currently registered in 
system. System is maintaining variable called any emergency 
in a form of set. So to find how many variables in a set we use 
cardinality operator, this operator return us total number of 
elements in a set. 

NumberOfEmergences ( ) total : int 

ext rd any emergency : set of isEmergency 

pre true 

post total= card anyemergency ; 

B. Smart Sticker Module 

This is another module of traffic system. To identify or 
sense traffic situation on road we need to deploy wireless 
sensors on roads, but what sensors will sense? This is 
worldwide problem many counties tried different things to 
solve this problem. So we are also giving our idea of smart 
stickers. We use smart sticker that we will put on every 
vehicle on road that a sensor can sense, this process will take 
time but it worthwhile. 

First part of the specification is type, in which different 
variables are declared and defined so when we have to save 
something for system, we can use these. Smart sticker have 
String variable that can save sequence of characters. Smart 
sticker have Sticker variable as composite data type, it will 
contain all the information about the vehicle and its owner on 
which it is attached. System has different variables to save 
sensors data and sensors states also there are a variable to save 
location. 

types 

String = seq of char; 

Sticker :: carnumber : String 

Reg: token 

vehicleModel : String 
engineNumber : String; 

Sensorin formation = set of token; 

Sensors =set of token; 

Location =token; 

Smart sticker state define the whole system generally, 
what it will have while running and what its initialization 
condition is. So the system will have location of the under 
consideration area, some basic ownership information and a 
set of stickers. So the invariant and initialization condition 
validates when this system initialized it will not have any data 
in it. It is initialized empty and afterward we will fill 
information. 

state Smarts ticker of 
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loc : map Sensors to Location 

getin formation : map Sensors to 
Sens or in formation 

location : set of Location 

smartsticker ;set of Sticker 

inv mk_SmartSticker , 1 , s) == 1 <> {} 
and s <> { } 

ini t i==i=mk_SmartSticker ( { \ ->} , { \ - 

>},{},{}) 

end 

First operation of the smart sticker is to set sensors on the 
road, because we need sensors to sense smart sticker. We can 
give location and a sensor to deploy it on the road or 
anywhere in parking etc. precondition will check this sensor is 
not already in use and if it returns true then that sensor will be 
deployed on the road by taking union with the set that have 
already deployed sensors. 

operations 

SetSensorsonLocation (sensorin : Sensors , loc 
at ion In : Location ) 

ext wr loc : map Sensors to 

Location 

pre sensorin not in set dom loc 
post loc = loc munion{ loca tionln \ - 
>sensorin } ; 

To get information from the sensor we need this operation, 
we just have to give sensor identification from which we want 
to get information. Pre-condition verify that the sensor that is 
called belongs to our system sensors. Post condition gets the 
information from sensor and assigns it to variable so system 
can access this information. 

Getin formt ion for smart sticker ( sensorin : Sen 
sors) 

ext wr getin formation : map Sensors to 
Sensorin formation 
pre (sensorin in set dom loc) 
post getin formation= getin formation 
munion { getin formationin \ -> sensorin}; 

This method is quite useful because of its functionality; 
this method is used to add more stickers’ information into our 
system. Because vehicles are growing day by day and we have 
to add more vehicles to our database so we can sense those 
cars also and our system work correctly. 

addSmartSticker ( carnumberln : String, 
regin : token, vehiclemodelln : String 
, enginenumberln : String, 

locat ion In : Location , sensorin : Sensors , 
getin formationin : Sensorin formation ) 
ext wr smartsticker : set of Sticker 

wr location : set of Location 


wr getin formation : map Sensors to 
Sensorin formation 

pre locationln in set location and 
sensorin <> { } and getin formationin <> { } 
post smartsticker= smartsticker union 
{mk_S ticker (carnumberln , regin, 
vehiclemodelln , enginenumberln ) } ; 

This method reflects its functionality by name; compare 
smart sticker will compare a given sticker from our database 
and return true or false accordingly. It takes car number, 
registration no, engine number and vehicle model to query 
database. 

Compare Smart Sticker ( carnumberln : String, 
regin : token, 

vehiclemodelln : String, enginenumberln : Stri 
ng) query : bool 

ext wr smartsticker : set of Sticker 

pre true 

post if ( {mk_S ticker (carnumberln , 
regin, vehiclemodelln , enginenumberln) } 

<> smartsticker) then query = false 
else query = true 

C. Traffic System Module 

First part in the specification is types that are required in 
system design. In types we have defined different variables 
that we will use later to perform different operations and to 
verify different conditions. Traffic system have different 
variables like string to save some characters, signals for telling 
that signal can have one value from the defined three values. 
System state to save current system state so administrator can 
do different necessary tasks when required. It also have 
current traffic situation, different restricted areas are also 
defined so no one can go there or certain actions can be 
performed if someone illegally enter that area. Different type 
of sensor information is also defined, location to save location 
name and sensors are defined that are deployed on locations to 
get information. 

types 

String = seq of char; 

Signals = 

<RED> | <AMBER> | <GREEN>; 
systemStates = 

<SMOOTH> | <CRITICAL> | <BROKEN> | <UPGRADE>; 
trafficSitutions = 

< JAMMED > | <CONGESTION> | <SMOOTH> | <ACCIDENT>; 
restrictedAreas = 

<DCOHOUSE> | <GOVTOFFICE> | <GRIDSTATION>; 
sensorin formation = 

<TRAFFICSITUTION> | <RESTRICTEDAREAS> | <SYST 
EMSTATES> | <SIGNAL>; 

Second part is of values but we don’t have any predefined 
values in our traffic system so this area will remain empty. 
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values 

Third part is state of the system; it is important part 
because system will be functional if state runs successfully. It 
allows system to wake up under certain conditions. State also 
has different variables, we can call them system variables, and 
these variables are always in the system with some values that 
are required during runtime. So we have variables like 
restricted area, current traffic, signal, and location, all these 
variables are explained earlier. Get information is variable that 
will save current sensor information. In invariant system is 
checking current traffic status must be from the defined states, 
and signals must also be red, green or blue. In first location is 
empty. 

state trafficsystem of 

restrictedArea : set of 
restrictedAreas 

currentTraffic : set of 
traffics! tut ions 

signal: set of signals 
getin formation: set of 
sens or in formation 

inv mk_traffi csystem (-, C, S, L, -) == 

(C subset 

{< JAMMED>, <CONGESTION>, <SMOOTH>, <ACCIDENT 

>}) 

and ( S subset 

{ <RED>, <AMBER>, <GREEN> } ) and L <> { } 

end 

Next part of the specification includes operations or 
functionalities of the system. 

operations 

Get information from sensors is second method and as 
name suggests it will collect information from sensors. This 
method has one parameter that is sensor from which we want 
to collect information. Pre-condition validates information 
from the sensor not already save in the system variable, 
because we don’t want to save duplicate information and then 
finally post condition return us information. 

Getinformtionfromsensors ( getin format ion in 
: sensorin formation) 

ext wr getin formation: set of 
sensorin formation 
pre getin formationin not in set 
getin formation 

post getin formation= getin formation 
union{ getin formationin } ; 

D. Violation of Rules 

This operation takes three parameters and checks for 
violation, if there is any violation it will return true. This 
operation is necessary for making a challan if any vehicle is 
violating rule, it will make police representative work easy. 


ViolationOf Rules (carSpeed: int , currentLoca 
tion : restrictedAreas , signalcrossing : 
signals) 

ext rd restrictedArea: set of 
restrictedAreas 

pre true 

post ( carSpeed > 100 ) or 

( currentLocation in set 
restrictedArea) or 

(signalcrossing = <RED>) ; 

Next operation is update current traffic situation, it will 
take number of cars as parameter and return us current traffic 
situation in quote type. By looking at the current traffic 
situation any one can understand it and can choose best route. 
In this method post condition is checking temp traffic, it is 
either less than fifty or less then hundred or greater than 
hundred and deciding from smooth, congestion or jammed 
and update current traffic situation. 

UpdateCurrentTraf fic ( tempT r af f ic : int) 
ext wr currentTraffic: set of 
trafficSitutions 
pre true 
post 

( (tempTraffic >= 0 or tempTraffic <= 50 ) 
and currentTraffic ={<SMOOTH>}) or 
((tempTraffic >= 51 or tempTraffic <= 

100 ) and currentTraffic = 

{ <CONGESTION> } ) or 

((tempTraffic >= 101 ) and currentTraffic 
= { < JAMMED> } ) ; 

As we are updating traffic situation, same as we have to 
update traffic signal timings accordingly, otherwise increased 
traffic on road will lead to congestion or jammed and less 
traffic will lead to more unnecessary wait on traffic signals. 
So to update signal timing, this method is looking for current 
traffic situation and updating signal timings accordingly. 

UpdateTraf ficSignal (signal time : int) 
ext wr currentTraffic : set of 
trafficSitutions 

pre true 
post 

if currentTraffic ={<SMOOTH>} 
then signaltime=2 0 

elseif currentTraffic = { <CONGESTION> } 
then signaltime =10 
else signaltime =5; 

Medical emergency is the most critical situation on roads 
if there is any incident. So our system is also dealing with this 
situation, if sensors reports about any accident, system will 
initialize an emergency call to nearest medical centre so they 
can help the affected person. 
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emergencecall () emergency : bool 

ext wr currentTraffic : set of 
trafficSi tut ions 
pre true 

post if currentTraffic = { <ACCIDENT> } 
then emergency = true 
else emergency = false; 


E. Traffic Signals 

Signals are important component in traffic management 
system. The existing methods for traffic monitoring and 
management are not fully efficient in terms of cost, 
performance, support and maintenance. In this paper, the idea 
of more utilization and accurately managing the traffic lights 
presented. 

Formal Specification of traffic signal control system is 
formally described with the help of VDM-SL. Traffic signals 
are modelled with the help of wireless sensors network, 
because sensors are small devices which are used for getting 
and providing different type of information. All 
communication links are established through wireless sensor. 
Traffic signals are important for smooth traffic transaction. 
Intelligent sensors which is reasonable of vehicles which are 
passing through signals. 

In this paragraph we have described the types and different 
variables that are first part of formal specification for traffic 
signals. Signal is defined with quote type which is 
combination of three colors. Variable color is token type; 
number of car is integer type. 

types 

String = seq of char; 

numberOfCars = nat; 

Color = token; 

Signal = <RED> | <AMBER> | <GREEN>; 
Signalstatus = 

<RED> | <AMBER> | <GREEN> | <THRESHOLD>; 

Location = token; 

Sensors = token; 

Second part of the specification is values, either there are 
any predefined values we are using in our specification or not, 
if any we have to define those values here. So we have 
defined different threshold values for different traffic load that 
we will use in update signal method for checking that in 
which threshold current traffic relay and will update signals 
accordingly. 


values 



ZEROTHREDHOLD: 

nat = 

0; 

MINTHRESHOLD : 

nat = 

10; 

MIDTHRESHOLD : 

nat = 

O 

LO 


The most important part of specification is state of the 
system. State of the system permits to initialize and run the 
system under specific conditions if true, and it also preserve 
all the values during system running that are required for 
proper functionality of system. The functions needed for 


proper initialization are called inv and init, inv (invariant) 
check all the preconditions and init (initializer) initialize the 
system with given values. In our system invariant of the 
system determine that the number of cars should be non-zero 
and the status of signal should be from given signal colors. 
The color of signal changes according to predefined timings 
but with the passage of time and change in traffic flow will 
lead to update signal timings. Initially all the variables are 
empty and number of cars are given value of zero. 


state signalUpdate 

of 


loc : 

map 

Sensors to 

Location 



sensor : 

map 

Id to Sensors 

information : 

map 

Information to 

Sensors 



sensors : 

set 

of Sensors 

getin formation forsignal : set of 

Signalstatus 



signals : 

set 

of Signal 

n : 

numberOfCars 

inv mk signalUpdate (- , - 

S,N) == 

N >= 0 and 


(S = 

{ <RED> } or 


s = 

{ <AMBER> } or 


s = 

{ <GREEN> } ) 



init s == s = mk signal Upda te ({ | ->} , { \ - 

>}, { |->}, {},{}, 0,0 

end 

) 



Operations defines the main functionality of any system, 
so in this part different operations and their functionality is 
described 


operations 

Add Sensor is the first operation in which sensors can be 
placed at different locations. This operation takes two 
parameters; one is sensor which is to be placed and second is 
location where the sensors are to be placed. These sensors are 
specific which have ability to get the traffic condition at any 
time. Pre-condition checks that sensor must not be already 
placed anywhere and post condition finally does placement 
after pre-condition return true. 

Addsensors {sensor in: Sensors , locationln: 
Location) 

ext wr loc : map Sensors to Location 
wr sensors : set of Sensors 
pre sensorin not in set dom loc 
post loc= loc munion{ sensorin \ -> 
locationln }; 

Sensor Information is the second operation that is 
necessary to update signal timing and also everyone on system 
administrator side and user side want to know, what the 
current traffic situation is. So this method take only one input 
and give us one output, it take sensor id as an input and gave 
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us output or information from that sensor in return. This is 
very simple method but it is very useful. We are giving id to 
sensor map that returns us a sensor, that sensor further given 
to information map, that return us the information associated 
with that senor, and finally delivered to information Out 
variable. 

sensorsln formation (id :nat) infoOut : 
Information 

ext rd sensor: map Id to Sensors 

rd information : map Information to 
Sensors 

pre id in set dom sensor 

post infoOut = in formation (sensor (id ) ) ; 

Signal Update is important and critical operation in any 
traffic control system, all the system depends on the timing of 
the signals. If signals timings are not updated according to 
traffic situation on road, it will create problems that lead to 
road accidents and congestion, and if signals fail to work it 
will crash whole transportation system. So we have carefully 
designed all the conditions that update signal timings 
according to traffic situation on road. This method is using 
different variables for calculating new signal timing and path 
is the variable that saves number of roads. Second parameter 
is numberofcars that will give information about number of 
cars on each path and also some thresholds are used that are 
defined by system admin. Threshold gives us information 
about numbers of cars when we have to update our signal 
timing, in response each signal will be updated. First we are 
checking path, we have denoted ids with different roads. First 
id is verified and then checking of number of cars, if it cross 
the threshold then signal timing increase, if it below the 
threshold timing will be reduced else remain same. 

SigalUpdate (path : nat, numberOfCars : nat , 

ZEROTHRESHOLD: int, MIDTHRESHOLD: int, 

MINTHRESHOLD : int ) time :nat , signalO : Signal r 

signal 1 : Signal , signal 2 : Signal , 

signal 3 : Signal , other: Signal 

ext wr signals : set of Signal 

pre path >= 0 and path <=3 

post 

if path = 0 

then if (numberOfCars >= ZEROTHRESHOLD) 
and (numberOfCars <= MINTHRESHOLD) 
then (signalO = <GREEN> and 
other=< RED>) and time=20 and path=path+ 1 

mod 4 

else if numberOfCars > MINTHRESHOLD 
and numberOfCars <MIDTHRESHOLD 
then (signalO = <GREEN> and 
other=< RED>) and time = 20 and 
path=path+ 1 mod 4 

else (signalO = <GREEN> and 
other=< RED>) and time = 30 and 
path=path+ 1 mod 4 
else if path = 1 


then if (numberOfCars >= ZEROTHRESHOLD) 
and (numberOfCars <= MINTHRESHOLD) 
then (signall = <GREEN> and 
other=< RED>) and time= 20 and path=path+ 1 

mod 4 

else if numberOfCars > MINTHRESHOLD 
and numberOfCars <MIDTHRESHOLD 
then (signall = <GREEN> and 
other=< RED>) and time = 20 and 
path=path+ 1 mod 4 

else (signall = <GREEN> and 
other=< RED>) and time = 30 and 
path=path+ 1 mod 4 
else if path = 2 

then if (numberOfCars >= ZEROTHRESHOLD) 
and (numberOfCars <= MINTHRESHOLD) 
then (signal2 = <GREEN> and 
other=< RED>) and time=20 and path=path+ 1 

mod 4 

else if numberOfCars > MINTHRESHOLD 
and numberOfCars <MIDTHRESHOLD 
then (signal2 = <GREEN> and 
other=< RED>) and time = 20 and 
path=path+ 1 mod 4 

else (signal2 = <GREEN> and 
other=< RED>) and time = 30 and 
path=path+ 1 mod 4 
else 

if (numberOfCars >= ZEROTHRESHOLD) and 
(numberOfCars <= MINTHRESHOLD) 
then (signal3 = <GREEN> and 
other=< RED>) and time=20 and path=path+ 1 

mod 4 

else if numberOfCars > MINTHRESHOLD 
and numberOfCars <MIDTHRESHOLD 
then (signal3 = <GREEN> and 
other=< RED>) and time = 20 and 
path=path+ 1 mod 4 

else (signal3 = <GREEN> and 
other=< RED>) and time = 30 and 
path=path+ 1 mod 4; 

VII. Formal Analysis with Results 

We have formally verified two components of our model 
emergency services Rescue 1122 and Smart Sticker through 
VDM-SL. The analysis of formal specification was done with 
checking of syntax free and type free errors. The details of 
formal analysis is given in figure 4, figure 5, Table I, Table II, 
Table III and Table IV. 
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Figure 3: Formal analysis of Emergency Services through VDM-SL Figure 6: Analysis of Traffic Rules by VDM 


The model of smart sticker and their properties have 


verified with VDM-SL tool box. 



Figure 4: Analysis of Smart Sticker through VDM-SL 


Table I 

Analysis of Emergency Services operation 


Operation 

Syntax 

Check 

Type 

check 

Integrity 

Check 

SetSensorsOnB ridge 

Y 

Y 

Y 

GetlnformtionForRescue 

Y 

Y 

Y 

N e wEmer gency 

Y 

Y 

Y 

GetEmergency 

Y 

Y 

Y 

NumberOfEmergences 

Y 

Y 

Y 



Table II 

Analysis of Smart Sticker operation 


Operation 

Syntax 

Check 

Type 

Check 

Integrity 

Check 

SetSensorsonLocati 

on 

Y 

Y 

Y 

Getlnformtionforsm 

artsticker 

Y 

Y 

Y 

AddSmartSticker 

Y 

Y 

Y 

CompareSmartStick 

er 

Y 

Y 

Y 


Table III 

Analysis of Signals 


OPERATION 

Syntax 

Check 

Type 

Check 

Integrity 

check 

Addsenors 

Y 

Y 

Y 

Getinformation 

Y 

Y 

Y 

SignalUPDATED 

Y 

Y 

Y 
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Table IV 

Analysis of Traffic Rules 


OPERATION 

Syntax 

Check 

Type 

Check 

Integrity 

Check 

Getinformation 

Y 

Y 

Y 

ViolationOfRules 

Y 

Y 

y 

UpdateCurrentTraffic 

Y 

Y 

Y 

UpdateTrafficSignal 

Y 

Y 

Y 

Emergencecall 

Y 

Y 

Y 


VIII. Conclusion 

This paper presented Emergency Services Rescue 1122, 
Smart Sticker, Traffic Signals and Violation of rules of our 
proposed model. The goal of traffic management system is to 
identify vehicles, to inform drivers about traffic situation and 
road conditions. Traffic is monitored remotely through sensors 
on roads for providing different facilities to users Smart 
Sticker identifies the vehicles and informs the local authority 
to update the system about vehicles. Smart Sticker provides 
security in smart traffic management system to restrict illegal 
activities in traffic flow. Emergency Services Rescue 1122 
provides emergency related services to overcome different 
incidents like accidents, fire fighting etc. Traffic signals are 
configured for the purposes of smooth flow of traffic. 
Violation of rules ensures road safety and provides incidents 
free traffic flow. Violation of rules also detects intruders on 
roads not following traffic rules. Sensors are distributed and 
configured on roads for monitoring different activities of 
traffic to update the system. Formal methods based technique, 
i.e. VDM-SL is used to develop formal representation of the 
proposed model. The proposed model components are 
validated, verified and analysed using VDM-SL Toolbox. 

The proposed model consists of Smart Sticker, Emergency 
Services Rescue 1122, LED, signal control timing, alternative 
paths on specific threshold and miscellaneous activities of 
traffic flow. Smart algorithms can be proposed to control 
signal timing on specific threshold and make traffic incidents 
free. LED can be installed on different points on roads to 
provide information about traffic flow to users. In future this 
model can be enhanced to provide smart services in traffic 
management system in the country. 
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Abstract — In single cellular networks, the mobile stations cannot 
communicate directly with each other. All communications are 
relayed through the base stations. Such topology suffers from 
many limitations such as congestion problem when a large 
number of users are communicating in the same time to a base 
station. In this context, the device-to-device communications have 
been proposed to overcome the limitations of the conventional 
cellular architecture. Indeed, a mobile station can allow two 
nearby stations to communicate with each other without 
involving a base station. However, security becomes an important 
challenge that must be taken into consideration as the mobile 
stations participate in routing data between each other. In this 
paper, we propose a secure routing protocol for Multi-hop 
Cellular Networks (MCNs). Our goal is to discover a secure and 
short route between the source and the destination. To evaluate 
this proposed protocol, we perform some simulations using 
Network Simulator (NS-2). The simulation results show that it 
provides acceptable performance in terms of throughput and 
routing overhead as comparing with Secure Ad hoc On demand 
Distance Vector (SAODV). 

Keywords-component; single cellular networks, base stations, 
Device-to-device, secure routing protocol, MCNs, NS-2; 

I. Introduction 

In traditional cellular (single hop) networks, the mobile 
stations can communicate with each other only through the 
base station (BS). Such architecture suffers from many 
limitations such as signal attenuation especially when the 
station is at the edge of the cell. In order to overcome partially 
some problems, installing a high number of BSs is required. 
However, increasing the infrastructure of such networks is 
very expensive at management and deployment phases. Other 
emerging alternative called Multi-hop Cellular Networks 
(MCNs) is currently considered as a part of the five 
Generation (5G) network evolution. It includes the integration 
of cellular and Ad-Hoc technologies. So, the direct link 
between mobile nodes and BSs is not required [1]. This 
alternative has a lower implementation cost in comparison 
with adding new BSs. Many studies have showed the 
advantages and benefits of MCNs. Indeed, the coverage area is 
increased [2] and the interference is reduced as the radiated 
energy is diminished. Moreover, the transmission rate is 
increased [3] due the reduction of the signal loss in each node. 
As the signal covers a small distance, the energy consumption 
of each node is also reduced. To achieve such objectives of 
MCNs network, it is necessary to take into consideration some 
important technological challenges such as design and secure 
multi-hop routing protocols. 


Two cases of communication are distinguished in MCNs. The 
first case is when the source and the target destination are in 
the same cell. In this case, the packets are relayed by the 
mobile terminals and the intervention of the base station is not 
required. However, when the source and the destination are in 
different cells, the BSs have to participate in routing process. 
In MCNs the participation of mobile nodes in routing process, 
makes security as an important challenge that must be taken 
into consideration. Indeed, an attacker can trace the paths, 
infer the source and its target destination and so it can track 
other users. 

In this paper, we propose a new routing protocol for MCNs. 
This protocol selects a secure and short route that ensures 
security in terms of confidentiality, integrity and 
authentication. To prevent the anonymity of the users, we 
propose that they use temporary identities generated by 
themselves every session. Moreover, to minimize the 
computational overhead for a node in verifying the validity of 
node’s certificate, we use "Weil Pairing” scheme [4]. This 
scheme can help each node to authenticate implicitly its 
neighbor with minimum complexity. To secure the exchanged 
data between them, the source and the destination generate a 
session key initiated by the source node. 

The paper is organized as follows. In section II, we present 
some related works. In section III, we present some notations 
and preliminaries. In section IV, we describe our proposed 
protocol. To evaluate its performance some analysis and 
simulation results are given in section V. We conclude the 
paper with the conclusion and some proposed future works in 
section VI. 

II. RELATED WORK 

Many routing protocols are proposed for Ad Hoc networks. 
These protocols can be performed in MCN networks by the 
source when it is in the same cell as the destination. In [5], 
Zapata et al. proposed a secure routing protocol named 
SAODV. This protocol has the same steps as AODV protocol, 
but it integrates some cryptographic techniques to prevent 
routing messages against attacks. It use hash chain to protect 
hop count information and digital signature to authenticate the 
source and destination. The disadvantage of SAODV is the 
lack of authentication between neighbor nodes. This can leads 
to some attacks such as impersonation attack. 

In [6] Zhou et al. proposed an Optimal and Secure Routing 
Protocol (OSRP) for Multi-hop Wireless Networks. This 
proposed protocol satisfies security by selecting secure routes 
that are resilient to attacks. OSRP relies on a Trusted 
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Clearance Center (TCC). According to the report behaviors of 
the nodes, the TCC computes and assigns a trust value for 
each node based on game theory. The authors prove that the 
TCC can detect attackers and segregate them from the wireless 
networks. Therefore, the OSRP can select optimal routes with 
low false alarm rate and high detection rate. However, this 
protocol is vulnerable to several attacks such as impersonation 
attack due to the lack of authentication between the nodes. In 
[7], K. Sanzgiri et al proposed a secure routing protocol called 
ARAN. It ensures security in terms of authentication, integrity 
and non-repudiation. It is based on asymmetric key 
cryptography and it involves a trusted certification server. The 
certificate requires the address and public key of the node and 
a time-stamp of when the certificate is generated and when it 
is expired. The disadvantage of this protocol is the use of a 
central authority and it is not secure against wormhole attack. 
In [8], K.V. Kumar et al. proposed a secure routing protocol 
which consists of three steps. In the first step, each node 
exchanges a secret key with its one and two hop neighbors 
based on its public key. When establishing a shared key, the 
node participates in routing process in the second step. In the 
third step, the source and destination share a secret key 
securely and then data communication is triggered. The 
disadvantage of this protocol is that the shared secret key 
between nodes is based on public keys and so attacks are 
possible to occur. 

III. Notation and preliminaries 

A. Weil Pairing 

The proposed protocol is based on “Weil Pairing” scheme for 
cryptographic foundation. Assume that Gl is an additive 
cyclic group over an elliptic curve; G2 is a multiplicative 
cyclic group and these two groups are with prime order q. 
“Weil Pairing” is the mapping of e: G1 x G1 — ► G2, for all 
P,Q,R, S eGl,we have: 

e (P + Q ,R + S) = e (P,R) e (P, S) e (Q,R) e (Q, S) 

B. Smart- Chen- Kudla scheme 

The proposed protocol is based on “Smart-Chen-Kudla” 
scheme for key generation. This scheme has the advantage of 
providing implicit authentication. It is based on “Weil Pairing” 
tool. In this scheme, the trust party is in charge of generating 
and distributing public parameters (< q , Hi, P, P pu b, Gl, G2 ), 
where Hi is a hash function {0, 1}* — ► G7, P is a generator of 
Gl, Ppub is the master public key formed as P pu b = sP, where s 
^ Zq is the master private key of the trust party. The trust 
party registers each mobile node Mi and assigns to it a master 
private key P t = s Qi , where Qi = Hi( IDi) and /A is the 
identity of Mi. 

When two communicants A and B want to share a secret key, 
each one generates a random value a and b respectively. 

Key generation phase between A and B is performed as 
follows: 

• A sends Ta = aP to B, 

• B sends T B = bP to A, 

• A calculates its secret key as the following: 


Kab = H 2 (abP || e{sQ A , T B)e(Q B , asP )), 

• B calculates its secret key as the following: 

K ba = H 2 ( abP || e(sQ B , T A )e(Q A , bsP)). 

Both users A and B share the same secret key: 

K=K ab = Kba = H 2 (abP || e(bQ A +aQ B , sP )) 

Where, H 2 can be a random oracle or a secure hash function 

[9]. 

C. Adversary Model 

We assume that the attacker knows network protocol and 
functions but it does not know the secret keys and parameters. 
Also, it can observe and collect data packets to analyze the 
node behavior and reveal its hider identity to break the privacy 
of this node. The attacker can learn other important 
information from these collected data such as message 
content. It can use this information to launch other attack such 
as impersonation attack or Sybil attack. It can also modify the 
exchanged messages or broadcast a wrong message to 
influence the routing behavior. 

The attacks that can be launched by an attacker from the 
collected information are listed below: 

Attack replay : An attacker store a message without 

authorization and then it retransmits this information in order 
to trick the destination into unauthorized operations such as 
duplicate transaction or false authentication. 

Sybil attack: The attacker generates a set of unauthorized 
identities in order to establish a neighbor relationship. The 
other nodes do not know that these identities are issued from 
an attacker due to some security problem such as the lack of 
authentication between them. 

Rushing attack: In rushing attack, the attacker transmits the 
route request packet to a large number of nodes using a high 
transmission range. The receiver of this false packet may be 
unable to respond the sender, and so cannot establish the 
route. 

Impersonation attack: The attacker tries to impersonate the 
identity of a legitimate node in order to become a member of 
the selected route or to establish a neighbor relationship with 
other nodes. Therefore, the attacker can receive the routing 
messages directed to the faked nodes and so it can modify 
their contents to falsify the network. 

IV. PROPOSED ROUTING PROTOCOL 

In this section, we present the network model as well as the 
description of our proposed algorithm. . 

A. Network Model 

In our network model, we are based on Multi-hop Cellular 
Network that consists of a fixed BS and several numbers of 
mobile stations (see figure 1). Before deployment, each new 
mobile station must authenticate to the BS in order to obtain 
the system parameters ( q , Hi, P, P pu b, Gl, G2) necessary to 
generate secret keys shared with the neighbor nodes. Also, the 
BS furnished to each node a temporary identity used by this 
node in its first communication. In this network model, we 
assume that the time is divided into successive sessions. Also, 
the BS and mobile stations are synchronized, so we adopt 
IDMt to compute the private key and the temporary identity 
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for each mobile Mi in each session {t is the timestamp). For 
that purpose, each node must know the beginning of each new 
session to renew its private key and its temporary identity. 

The private key of a mobile Mi is computed as the following 
equation: Si = s Qi 

Where: Qi = Hi(IDMt) and IDi is the real identity of the 
mobile. 

In each session, the mobile have to perform a neighbor 
discovery phase in an authenticated way. During this phase, 
each node generates a random value ai and sends to its 
neighbor Pi value based on Pairing Discrete Logarithm 
Problem (PDLP): Pi = aiP. 



Figure 1: Topology of Multi-hop cellular network cell 
B. Proposed algorithm 

We propose a new anonymous on-demand routing protocol 
which achieves communication between mobile nodes without 
disclosing their real identities. Indeed, during route discovery 
process, the nodes use their temporary identities instead of 
their real identities. Indeed, when a node sends its real identity 
in plaintext a passive attack can be performed by the attacker 
by analyzing exchanged packets. Also, the disclosure of real 
identity makes the attacker able to trace packets backward to 
the source or forwarded to the destination. This is undesirable 
because in some case the source or the destination is a critical 
node. Moreover, when a node uses the same temporary 
identity for long time, it cannot be protected against many 
types of attacks, because this temporary identity can be 
analyzed the same way as its real identity. Thus, each node 
must use a dynamic identity in each session. These identities 
are used in data transmission after route selection. So, the 
route anonymity is achieved and the attacker cannot infer the 
participating mobile nodes in one session. 

To secure routing packets, we are based on a symmetric 
cryptography based on Weil Pairing tool. This scheme 
guarantees also an implicit authentication instead of verifying 
the validity of node’s certificate and involving a trusted 
certification server. This leads to minimize the expensive 
cryptographic mechanism in term of time and complexity. 

The source and destination share a secret key used to secure 
data transmission phase. This key is initiated by the source 
node during route request process since the source and its 


destination are not neighbor to exchange secret information 
( P s and Pd) in the neighbor discovery process (see Network 
Model section). 

Our proposed protocol is divided into two phases: route 
request (see figure 2) and route reply phases. 



Figure 2: Flow chart of route request phase 


1 ) Route request phase 

Route request phase is initiated when a source needs to 
communicate with a target destination for which it has no 
route maintained in its routing table. To secure route request 
packet, each mobile station Mi computes a session key shared 
with its neighbor M 2 based on Weil Pairing as the following 
equation: 

(1) K M im 2 = e(S h P 2 ) e(Q 2 , ai sP) 


= e( S 2 , Pi) e( Qi, (X 2 sP) - Km 2 mi 
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The route request phase is performed as the following steps: 

• Step 1: The source S generates and sends to each neighbor 
an RREQ packet: 

Step 1-1 : S generates a random value re Z q to compute a 
secret key shared with the destination D as the following 
equation: 

(2) K S D=e(rSs,Hi(ID D \\t)) 

Step 1-2 : S generates session key shared with each 
neighbor Mi as equation (1). 

Step 1-4 : S encrypts its temporary identity with Ksd shared 
with D to guarantee its anonymity and so the intermediate 
nodes cannot know who communicate to whom. 

Step 1-5 : S sends to each neighbor Mi the packets RREQs 
protected by Ksi. 

The format of RREQ packet is as follows: 

RREQ: {E (Ksu seq_num II TTL II Hop_count II r//;(IDs||0 
),E(K sd ,XID s ),H 2 (*)}. 

The source adds to RREQ packet the seq-num to prevent the 
route request against replay attack, TTL is used to limit the 
propagation area of the route request, Hop_count is the 
number of hops traversed by the route request and it is 
incremented by each hop, rHi(IDsWt) is used by D to compute 
the shared key with S, E(Ksd, XIDs) is the temporary identity 
of S encrypted by the secret key Ksd for anonymity and El 2 (*) 
is the hashed value of the RREQ packet. 

If S does not receive a route reply in a fixed time period, it 
sends a new RREQ packet. If it sends k packets and no 
response sent by D, it record that this destination is 
unreachable. 

• Step 2 : An intermediate mobile station Mj receives a 
RREQ packet from its neighbor M u . It performs the 
following subsets: 

Step 2-1 : Mj decrypts the RREQ packet by the shared key 
with M u . By this way, Mj authenticates M u because only 
this node can compute K ju , 

Step 2-2 : Mj checks the integrity of the RREQ by 
computing its hash value. If the verification passes, goes to 
step 2-3. Otherwise, discard this packet, 

Step 2-3 : Mj compute the K S j= e(rHi(ID s \\t), Sj)) in order 
to checks if it is the destination of this process. If Mj can 
decrypt XIDs , so it is the destination, goes to step 3. 
Otherwise, goes to step 2-4. 

Step 2-4 : Mj decrements the TTL value and increments the 
hop_count by one, 

Step 2-5 : Mj computes the new hash value of RREQ, 

Step 2-6 : Mj encrypts RREQs packets and sends it to the 
next neighbors, 

• Step 3: D receives RREQs packets. It performs the 

following subsets : 

Step 3-1: D decrypts the RREQs packets received through 
different routes and so authenticates each neighbor. 

Step 3-2: D selects the packet comes from the shortest 
route based on hop_count value. 


Step 3-3 : D compute Kds and decrypts the temporary 
identity of S by its private key. It maintains this key in its 
routing table to prevent the data exchanged with S after 
route selection. 

K DS =e(rH l (lDs\\t),S D ) 

= e( r //i(TDsl I t), sIh(ID D \\t)) 

= e(rsH x (\D s \t),H 1 (ID D \\t)) 

=e(rS s , H!(ID D \\t) 

=K sd 

Step 3-4 : D checks the integrity of this packet. If this 
verification passes, goes to Step 3-4. Otherwise, it discards 
this packet and selects the second one corresponding to the 
shortest route. 

Step 3-5 : D lunches the route reply phase. 

2 ) Route reply phase 

After selecting the RREQ with minimum hop_count, D 
performs the following Steps: 

• Step 1: D generates the corresponding RREP, 

Step 1-1 : D computes its temporary identity: 

XID d = Hi(ID D \\t) 

We assume that each node of the selected route compute a 
temporary identity used in data transmission after route 
selection to guarantee route anonymity. 

Step 1-3 : D computes the hash value of RREP packet, 

Step 1-4 : D sends the RREP encrypted by K D , t toward S 
through the reverse route. Where M t is its neighbor node of 
the reverse route. 

The format of RREP is as follows: 

RREP: { E (K d , t , XID d || seq_num\ H 2 (*) }, 

• Step 2: An intermediate node Mk receives a RREP packet 
from its previous neighbor. 

Step 2-1 : It decrypts the RREP packet by the key shared 
with the sender of the RREP, 

Step 2-2 : Checks the integrity of the RREP. 

Step 2-3 : Mk maintains the temporary identity of the 
previous node and add its new identity to RREP. 

Step 2-4: Mk re-computes the hash value of RREP packet, 
Step 2-5 : Mk encrypts RREP and sends it to the next node 
of the reverse route. 

• Step 3: S receives the RREP. It decrypts it and verifies its 
integrity. Then, it uses the corresponding route to 
communicate with D. The data transmission phase is 
protected by the session key shared between S and D. 

V. security Analysis and evaluation of our proposed 

PROTOCOL 

A. Security analysis 

Our proposed protocol acheives the following security 
requirement: 

Confidentiality: The routing packets are protected by the 
secret keys shared between neighboring nodes. These keys are 
generated using the Weil Pairing scheme based on several 
secret parameters such as the private key of the BS. An 
attacker has to solve the PDLP to find the secret key of the BS 
and compute the shared keys. Also, the source and the 
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destination share a session key to encrypt data transmitted 
after route selection. 

Integrity: In the proposed protocol, each transmitted packet is 
concatenated with its hash value to provide integrity. To 
modify a packet, an attacker must decrypt its content and then 
computes the corresponding hash value of this modified 
packet. However, the decryption of such packet needs to learn 
the secret key shared with the sender. The shared keys and the 
private keys are generated using “Weil Pairing” scheme. 

By this way, the attacker cannot learn the correct keys to 
generate the corresponding hash value. So, the proposed 
protocol ensures the integrity of transmitted packets. 
Authentication: In our proposed protocol, we are based on 
Weil Pairing scheme for key generation. This scheme has the 
advantage to provide implicit authentication based on several 
secret parameters. 

Anonymity and Intractability: In the proposed protocol, 
each node participates in the route establishment phase using 
its on-time temporary identity. Based on this identity, an 
attacker is not able to reveal the corresponding real identity. 
Also, our anonymous routing protocol does not reveal the 
information related to the source, destination and the 
intermediate nodes. Even, these nodes cannot learn with which 
node the source communicates. An attacker is not able to trace 
the RREQ packet based on its common parts to discover the 
source and the destination nodes. In fact the RREQ is 
encrypted by the shared keys of the intermediate nodes. So, 
our proposed protocol provides anonymity and intractability. 
Key secrecy: The proposed protocol ensures perfect forward 
secrecy because when an attacker compromises the secret keys 
of all nodes, it cannot reveal the previous session keys. This is 
because each session key relies on random values. 

Our proposed protocol is secured against the following 
attacks: 

Replay attack: This attack cannot be realized in our proposed 
protocol because each session key is relies on random values 
generated by the two neighbor nodes. So, the new keys will be 
generated without any links with the previous session keys. 
Also, using the timestamp in computing a new temporary 
identity and a sequence number for each new packet prevent 
our proposed protocol against replay attack. 

Sybil attack: In our proposed protocol, the BS assigns to each 
node a private key as function as its identity and the private 
key s. So, to perform the Sybil attack, an adversary has to 
generate a private key for its false identity. This is not possible 
for an attacker because it must resolve PDLP problem in order 
to learn the private key s of the BS. So, the Sybil attack is not 
possible in our proposed protocol. 

Rushing attack: In our proposed protocol, when a node 
receives a request packet, it authenticates the sender of this 
packet by verifying the encryption key. If the authentication is 
performed successfully, the node accepts the packet and 
responds the sender because only a legitimate node can 
generate a valid key. Otherwise, it drops the received packets. 
So, if an attacker forwards a packet using a large transmission 
range, this packet will not be accepted by the receiver. 


Therefore, the rushing attack cannot be realized in our 
proposed protocol. 

Impersonation attack: In our proposed protocol, to perform 
this attack, an attacker must generate a secret key shared with 
the nodes to which it will send the message. However, this 
attacker is not able to solve the PDLP problem to learn the 
secret key of BS and computes a valid private key 
corresponding to this impersonated identity. Also, it is 
infeasible to learn the real identity of a legal node and 
compute its private key because the request packet does not 
contain a real identity of any node. Thus, the attacker fails to 
impersonate another legitimate node. 

B. Simulation results 

To evaluate the proposed protocol, a set of extensive 
simulations is conducted using Network Simulator (NS-2) 
[10]. We compare it with SAODV protocol. The 
implementation of the security techniques is performed by 
applying the Crypto++ library because NS-2 does not support 
these functions. 

The parameters of the simulation are summarized in Table I. 


Table I: Parameters for simulation evaluation 


Parameters 

Value 

Routing Protocol 

SAODV, Proposed protocol 

Traffic Type 

Constant Bit Rate (CBR) 

Simulation duration 

100 seconds 

Packet Size 

512 bytes 

Simulation Area 

500 m x 500 m 

Total number of mobile 
stations 

20 

Number of malicious 
nodes 

5 

Queue length 

50 packets 

MAC protocol 

MAC/802.11 

Mobility model 

Two Ray ground 


The impact of the presence of malicious nodes is measure 


based on the following metrics: 

^ Throughput: is the average of data received by the 
destination among the total number of data delivered by 
the source during the simulation time. 

^ Routing overhead (packets): is the number of routing 

packets needed to send in order to deliver the data packets 
from the source to the destination. 
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Figure 3: Overhead versus number of source node 
Figure 3 represents the routing overhead versus the number of 
source node. We note that, the value of the overhead increases 
when the number of source nodes increase for both SAODV 
and the proposed protocol in the presence of five malicious 
nodes. However, in our proposed protocol the overhead is less 
than SAODV because each node sends the routing packets 
only to the neighbor nodes which share with it a secret key. In 
SAODV, the route request packets can be sent by the attackers 
as no authentication between nodes is guarantee. 



Figure 4: Throughput versus number of source 
As show Figure 4, the throughput increases for both the 
SAODV and proposed protocol when the number of source 
node increases. However, our proposed protocol has a higher 
value in comparison with the SAODV. This is because during 
route discovery process the nodes authenticate each other and 
so the probability that an attacker becomes a member of the 
selected route to drop data packet is low. This is not 
guaranteed in the SAODV because the authentication is 
performed only between the source and the destination. 


VI. CONCLUSION 

In Multi-hop Cellular Networks, the implementation of a 
reliable routing protocol which ensures both the network 
performance and the security requirement is a challenging 
task. In this paper, we propose a secure routing protocol which 
selects the shortest path between a source and its target 
destination. This proposed protocol provides performance both 
in term of security requirements and network performance. In 
the protocol design, we aim to fit inexpensive cryptographic 
mechanism in each phase in order to make it robust against 
attacks. 

In the future work, we plan to extend this protocol to be used 
when the source and the destination exist in different cells and 
so other types of relay will be considered in routing the 
exchanged packets. 
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ABSTRACT 

Investigation on privacy preserving data mining is in extensive 
need to the present day technological situation. Storage of the 
data and its usage through various computational processes is 
becoming very easy and efficient. At the other end the primary 
concern or sometimes can be termed as limitation to this 
extensive data analysis is privacy. There are existing privacy 
preserving techniques that solve this problem and also guarantee 
privacy as well as data utility. But these techniques have to be 
updated in parallel to the expansion of digital technology. In view 
of this, the part of research in this paper analyses various 
normalization techniques with heterogeneous data distortion. The 
experimental consideration is done with the comparison of 
various statistical measures on the distorted data and their 
preservation with respect to the original data. We evaluated the 
performance of heterogeneous data distortion with three types of 
transformations namely Min-Max Normalization, Z- Score 
Normalization and Decimal Scaling. The performance is 
evaluated with various data distortion measures and privacy 
measures. 

Key words: 

Privacy Preserving Data Mining (PPDM), Data Normalization, 
Privacy, Data utility. 

1. Introduction 

Every user of technology can notice the raise in data 
collection, storage and its usage. This information 
explosion enables us to extract valid useful patterns using 
various computational techniques. Data mining is a set of 
well defined techniques can be applied on these massive 
amounts of data and can deduce valid hidden patterns [1] 
[2]. Marketing, Sales, Finance, Insurance, Weather, 
Banking and Health sector etc., are various application 
fields of data mining. Data sharing and processing among 
various organizations is an important phenomenon now 
days with the rapid advance in internet and communication 
technology. 

Privacy stands as a main threat at the apathetic 
side of this aggressive data mining. The data owners who 
stay at the low end in the data sharing scenario are 


becoming victims of this privacy violation. However, we 
cannot ignore the obvious data sharing and its processing. 
But one has to take the considerable safety measures to 
protect the data privacy. 

Privacy preserving data mining is one such 
prominent area used to prevent the data disclosure. Its 
main focus is that ‘do the data mining in a privacy 
preserving manner’ [3] [4]. Several privacy preserving 
techniques have been proposed and used in various 
applications. However, all these techniques follow one 
level of privacy for all data. In real time it is not applicable 
because, privacy is an individual choice of data disclosure. 
In this paper we propose an experimental study of various 
normalization techniques with heterogeneous data 
distortion. 

The rest of the paper is organized as follows. 
Section 2 provides related literature in privacy preserving 
data mining, section 3 elaborates the proposed work with 
the flow chart. Experimental study is tabulated and also 
graphically represented in section 4. Section 5 gives the 
conclusion of the work. 

2. Related Literature 

Several methods have been proposed in the area of privacy 
preserving data mining to come across the problems 
caused by excessive data mining and to protect the 
privacy. The primary classification of this includes 
anonymization, perturbation and cryptographic techniques. 
Anonymization is a process of de- identifying the original 
data with semantically meaningful substitution. Various 
techniques K- Anonymity [5] [6], L-diversity [7], t- 
closeness [8], comes under this category. In the 
perturbation approach the data is being modified by 
including noise component [9] [10]. In [10] random data 
perturbation technique is proposed and they have also 
shown the construction of original data distribution and 
building a data model with acceptable deviation. The 
existing perturbation techniques follow one-size-fits-all 
approach which is relatively inflexible. To enhance the 
existing perturbation methods another work proposed in 
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[11] has performed the perturbation at two different levels 
with different intervals. 

Privacy preserving techniques can be proposed in 
accordance with the data mining technique being applied. 
Privacy preserving data clustering by data transformation 
is implemented in [12] [13]. K-means clustering on 
vertically partitioned data in a privacy preserving manner 
is proposed in [14]. Privacy preserving data classification 
is studied in [15]. First the data transformation is 
performed and then the classification model is built on the 
distorted data. Its accuracy is measured with distorted data 
against original data. Privacy preserving association rule 
mining is implemented in [16]. These methods illustrate 
the procedure of hiding the sensitive rules by decreasing 
association of the extracted rules. 

This part of work mainly aims at to study the 
effectiveness of data modification through perturbation 
techniques. A proper evaluation of perturbation techniques 
shows a better performance than other categories in terms 
of accuracy and computational time. The work proposed in 
[11] has motivated us to perturb the data values in a 
heterogeneous manner. The quality of data distortion is 
measured in terms of various utility and privacy measures 
[17] [18]. Privacy gain and information loss are two 
important factors have to be maintained with a proper 
balance. The roles involved in the data sharing scenario 
can fix their thresholds for data utility and privacy levels. 
This work performs the data distortion with three different 
transformation techniques and evaluates their performance 
with various measures. In addition to the normal way of 
data transformation the heterogeneity component is also 
introduced. 


3. Proposed Work 

In this section we illustrate our work with the 
flowchart given in figure 1. One privacy level fits for all 
approach is not suitable for better privacy protection and 
data utility. In this regard we proposed a new 
heterogeneous data distortion in our earlier work [19] [20]. 
Our earlier work has categorized the data into three 
various classes namely High, Medium and Low. In order 
to perform this categorization we proposed a different 
privacy analysis approach. In that process it uses the 
privacy preference of the owner, privacy decision of the 
collector and existing correlations. Using these validations, 
the data could be mapped to any of the available classes. 
Then accordingly the perturbation with various threshold 
levels is introduced. We recommend the readers to go 
through our earlier work proposed in [19] to know more 
about this privacy mapping. 

As a part of this research, after the data mapping to 
privacy classes various normalization techniques are 
applied to perform the data transformation. We have 
evaluated the performance of heterogeneous data 
distortion with Min-Max normalization, Z- Score 
normalization and decimal scaling. A scaling factor known 
as privacy threshold is also added accordingly to each of 
the class. After the data gets transformed it has to preserve 
some properties to prove its effectiveness in data utility 
and privacy protection with an acceptable deviation. In 
order to do this various statistical measures are evaluated 
on transformed data against original data 



Figure 1 Flow Chart of Proposed Work 
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3.1 Min-Max Normalization 


Min-Max normalization transforms the attribute 
data to fit into a specific range. 

For example a value A to B which fits in the range 
[C, D] 

„ A— minimum value of A , „ _ 

B = * (D — C ) + L 

maximum value of A-Minimum value of A 

Equation 1 


3.2 Z-Score Normalization 

The Z-score normalization linearly transforms the 
data in such a way that the mean value of the 
transformed data equals to 0 and their standard deviation 
is equal to 1 . 

The values for an attribute, A, are normalized based on 
mean \i A and o A 

A Value, x, of A is normalized to x' using the 
following transformation formula 

v , _ X-\i 

X — Equation 2 

a 

Where X' — tranformed data 

X — data to be tranformed 
g — Mean of attribute 
o — Standard Deviation of attribute 


3.3 Decimal Scaling 

Decimal scaling transforms the data into a range [- 
1, 1]. It normalizes by moving the decimal point of 
values of attribute A. 

The transformation formula is 


V' = 


v 

10 j 


.Equation 3 


where j is the smallest integer such that Max (|v'|) < 1 . 
The normalization process usually changes the original 
data quite a bit and we can choose the desired 
transformation technique depending upon the acceptable 
deviation. 

3.4 Statistical Measures 

Every data modification process has to be 
evaluated carefully. The drastic change may negatively 
affect the data utility and less change will give the same 
on privacy. Hence both the properties have to be 
preserved in a balancing manner. The following 
properties shown in table [1] are used to perform this 
evaluation. 


Table 1: List of Measures 


Distortion Measure 

Equation 

Mean 

II 

>|M 

Standard Deviation 

o= lx *-* 2 

\ N 

Signal to Noise Ratio 

(i 

SNR = - 
o 

Mean Square Error 

MSE^Ef^-ri 2 

Mean Absolute Error 

MAE= Jjl,i =1 \x-n\ 

Utility Measure 

Equation 

Information Loss 

(N-O) / (U-L) 


3.5 Information Loss Metric (IL) 

A new metric is proposed to measure the information 
loss. The basic idea is drawn from the metric proposed 
in [21]. In the proposed work the data distortion is 
performed at various classes, hence a variant of existing 
metric is imposed to measure the information loss in 
each of the privacy class. 

IL class = ( N h~ X h ) / (Uh ~ L h) ••• Equation 4 

N h — New Distorted Data X h — Original Data 

U h - Max Value in Class h , L h Min Value in class h 

The amount of data distortion can be performed on the 
basis of information loss metric value. If the 
IL attribute measure returns a ‘O’ then it means that there 
is no distortion and if it is ‘ 1’ it implies full distortion. 
The administrator has to take proper decision to fix this 
parameter and thus decides to what extent the data utility 
and privacy can gets compromised. This measure 
hopefully helps us to provide a balancing factor between 
the data utility and privacy. 


1.2 


1 

0.8 

0.6 

0.4 


minimi 


1 2 3 4 5 6 7 8 9 1011 



■ Information 
Loss 

■ Data Utility 


Figure 2 Information Loss Ys Data Utility 
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4. Experimental Consideration 

This section gives the detailed result analysis. Our 
experiment is performed on three different data sets given in 
table 2. The illustration with sample data is given in table 3. 


Table 2: Data Set Description 


Dataset 

Attributes 

Instances 

Classes 

Adult Data Set 

14 

32561 

2 

Liver Data Set 

7 

345 

2 

Income Data 
set 

9 

100 

- 


Table 3: Sample Data 


Original 

Data 

Data with 
Min-Max 

Data with 
Z-Score 

Data with 
Decimal 
Scaling 

60 

61.66667 

61.71636 

67 

76 

78.73333 

78.45224 

84.6 

20 

18.5 

21.33601 

22.5 

75 

77.66667 

77.40625 

83.5 

25 

23.83333 

26.68612 

28 

76 

78.73333 

78.45224 

84.6 

76 

78.73333 

78.45224 

84.6 

76 

78.73333 

78.45224 

84.6 

36 

35.56667 

38.45636 

40.1 

76 

78.73333 

78.45224 

84.6 

76 

78.73333 

78.45224 

84.6 

45 

45.16667 

48.08655 

50 

35 

34.5 

37.38634 

39 

66 

68.06667 

67.99232 

73.6 

76 

78.73333 

78.45224 

84.6 


Result analysis is carried on three different transformations 
and finally checked with various measures. In this work we 
proposed a new information loss metric for evaluating the 
amount of data reduction. The administrator can check the 
value and accordingly can fix the threshold parameter. 


Table 4: Metric Evaluation 


Adult Data 
Set 

Original 

Data 

Data with 
Min-Max 

Data with 
Z-Score 

Data with 
Decimal 
Scaling 

Mean 

59.6 

61.0733 

61.88 

66.39333 

Standard 

Deviation 

21.2159 

22.86105 

21.41798 

23.56824 

Signal to 
Noise Ratio 

2.80921 

2.6715 

2.88926 

2.817 

Mean 

Square 

Error 

420.1066 

487.785 

428.1477 

518.43129 

Mea 

Absolute 

Error 

18.266 

19.70678 

18.35003 

20.31567 

Sum of 

squared 

error 

6301.6 

7316.786 

6422.21549 

777.46932 



Figure 3 Comparison Plot for MAE 


Mean Square Error 

600 
500 
400 
300 
200 
100 
0 

Original Min-Max Z-Score Decimal 

Figure 4 Comparison Plot for MSE 

The graphical representation given above on three 
transformations proved that our data preordering technique 
shown desirable performance with respect to all metrics 
with different amounts of deviations. The deviation rate is 
high in decimal scaling followed by Min-Max followed by 
Z-score. 



Table 5: Comparison of original and perturbed data sets on Adult 
Data Set 


Adult Data 
Set 

Original 

Data 

Data with 
Min-Max 

Data with 
Z-Score 

Data with 
Decimal 
Scaling 

Mean 

38.57689 

38.46562 

40.57101 

43.14101 

Standard 

Deviation 

13.6377 

14.58908 

14.28473 

15.20555 

Signal to 
Noise Ratio 

2.82869 

2.6366 

2.84017 

2.83719 

Mean 

Square 

Error 

185.9812 

212.8346 

204.04737 

231.20159 

Mea 

Absolute 

Error 

11.18829 

12.02336 

11.66317 

12.52910 

Sum of 

squared 

error 

6041785.34 

982 

6914144.6936 

3 

6628682.7959 

1 

7510814.7163 

3 
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Table 6: Comparison of original and perturbed data sets on Liver 
Data Set 


Mean Square Error 


Liver Data 
set 

Original 

Data 

Data with 
Min-Max 

Data with 
Z-Score 

Data with 
Decimal 
Scaling 

Mean 

44.74614 

46.39698 

47.04387 

49.92401 

Standard 

Deviation 

16.18983 

16.82336 

16.5242 

18.06575 

Signal to 

Noise Ratio 

2.76384 

2.75789 

2.84697 

2.76346 

Mean Square 
Error 

261.66111 

282.53992 

272.58098 

325.81152 

Mean 

Absolute 

Error 

13.31455 

13.93772 

13.34759 

14.95953 

Sum of 

squared 

Error 

152548.42596 

164720.77071 

158914.71141 

189948.11821 


Table 7: Comparison of original and perturbed data sets on INCOME 
Data Set 


INCOME 
Data set 

Original 

Data 

Data with 
Min-Max 

Data with 
Z-Score 

Data with 
Decimal 
Scaling 

Mean 

58.57576 

59.9202 

60.94132 

65.20606 

Standard 

Deviation 

17.90627 

19.36656 

17.94544 

19.96342 

Signal to 

Noise Ratio 

3.27124 

3.094 

3.39592 

3.26628 

Mean 

Square Error 

317.39578 

371.27505 

318.78573 

394.51239 

Mean 

Absolute 

Error 

15.75814 

17.07966 

15.63114 

17.60491 

Sum of 

squared 

Error 

31422.18186 

36756.23034 

31559.78681 

39056.72634 



Figure 5 Graphical Representation on Mean 
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Figure 6 Graphical Representation on Mean Square Error 


Table 8: Information Loss on Adult Dataset 


Feature 

Privacy Class 

Information Loss 

AGE 

Class 1 

0.117647 

Class 2 

0.185185 


HIGH 

1 

Hour-per-Week 

Class 1 

0.3 

Class 2 

0.102564 


Information Loss 


1.2 

1 

0.8 


0.6 

0.4 

0.2 

0 


C ass : 


1 Class 


2 HIGH C 
AGE 


t 


ass 


IClass ; 


Hour-per- 

Week 


■ Information 
Loss 


Figure 7 Graphical Representation on Information Loss on Adult data 
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3. Conclusion 

In this paper we proposed a comparative analysis of various 
data transformation techniques with heterogeneous data 
distortion method. The techniques namely Min-Max, Z- 
Score, and decimal scaling have been applied to transform 
the data for privacy protection. These normalizations are 
applied at various privacy classes. The distorted data is 
evaluated against various distortion measures and privacy 
measures. A new privacy measure is implemented to 
measure the level of data distortion in each of the privacy 
class. The administrator can take the decision on amount of 
noise to be added depending upon the information loss 
metric. All the three transformation techniques have 
performed in accordance to the data perturbation with 
different data deviation rates. We conclude that our 
proposed method for data categorization into various 
privacy classes can be adoptable to any distortion and thus 
enhances the privacy protection. Results have shown that 
proposed heterogeneous data perturbation provides better 
privacy. 
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Abstract — there is a correlation between pixels in each image so 
that each pixel value of adjacent pixels can be guessed. By 
removing these dependencies can be compressed images. Our 
goal is to reduce the amount of compressed image data needed to 
display the digital images and therefore reduce the cost of 
transmission and storage. Compression has a key role in many 
important applications. These applications include image 
database, transmission of images, remote sensing, medical 
imaging, military and space equipment remote control and so on. 
In addition to the compression, image coding, there's talk. That 
after quantization matrix should be coded range of conversions. 
In reconstruction after decoding to achieve our desired image 
obtained with the difference that the picture is far less than the 
original image. What we've done in this thesis using a fractal 
method utilizes a Kohonen neural networks and clustering to 
increase the compression ratio and reduction coding and 
decoding the image. We have implemented three methods based 
on fractal coding. The first method is simple fractal coding. In the 
second method to create the codebook of multiple tree fractal 
coding is used. In the second method of vector quantization LBG 
algorithm for Kohonen neural network-based clustering 
algorithm and code book for coding image is used. Results in the 
second method show faster encoding. The method is simple 
fractal compression rate is higher than other methods. 

Keyword: image compression; clustering; vector quantization 


I. Introduction 

Image compression is the application of data compression 
on digital images. In effect, the objective is to reduce 
redundancy of the image data in order to be able to store or 
transmit data in an efficient form [1]. Data compression has 
become requirement for most applications in different areas 
such as computer science, Information technology, 
communications, medicine etc. In computer science, Data 
compression is defined as the science or the art of 
representing information in a compact form [2]. 

Image processing systems can compress raw images with 
the resolution and quality desired, Nowadays and thus to 
achieve different levels of compression as applicable. The 
compression ratio can be changed depending on the intended 
application. This need has always existed. For example, it is 
possible to transmit real-time images and video via packet- 
switched networks. 

High quality images are obtained from the studio, medical 
images, satellite imagery and scanned images of manuscripts in 
order to preserve historical monuments. 


They need to compress because of the immense size for 
storage, maintenance, as well as sending via communications 
equipment with limited bandwidth. For this purpose, the 
resolution and bit rate compressed images decrease as much as 
the eye does not recognize. 

Image compression, is processing that eliminates additional 
information, reduces data into digital signals. 

This process depending on the bandwidth required for data 
transmission and storage, to reduce additional information. 
Reducing the bandwidth required to transfer more data at the 
same time. 

Compression methods remove duplicate information in the 
image and with suitable encoding techniques reduce the size of 
image files. Dhawan et al. (201 1) reviewed and discussed about 
the image compression, need of image compression, its 
principles, classes and various algorithm of image 
compression. They mainly worked over gray scale images of 
Lena and finger print images [3]. 

Compression methods can be divided into two groups. 
Lossy compression and lossless compression. 

Lossless compression tries to have the least amount of 
change in image quality 

However, in lossy compression due to the loss of parts of 
the picture finally compressed image will be lower quality. So, 
according to intended use can be determined compression type. 

In this paper compression based on clustering that the 
purpose of this paper is that the images are divided into several 
groups, and in this division images, different groups should be 
as different as possible together and images contained in a 
group must be very similar. 

Clustering method is used to analyze the data relating to 
distances of between the border and the center of gravity and 
angles between the distances of the images. 

K-means is one of the most used clustering algorithms 
which word of K in the name of the algorithm refers to the fact 
that the purpose of this algorithm is to find a fixed number of 
clusters 

According to the data points is near which is based on 
image compression using the k-means algorithm is 
implemented. The algorithm has two major drawbacks: First, 
algorithm is depend on the initial values of the number of the 
cluster and capitals clusters Second, it is easily trapped in 
local optimum and therefore does not generate the optimal 
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solution. The method of fractal is another method which used 
features of self- similarity in the image. 

The advantage of this method is a high compression ratio, 
good mathematical structure for speed coding and decoding. 
The drawback of the Fractal method is that the coding time in 
this method is high.In this paper, vector quantization coding of 
a clustering algorithm based on Kohonen self-organizing map 
is used to improve the image.The simulation results on the 
images, shows that this method of coding time reduced, no 
significant change in the reconstructed image quality and 
compression ratio. Simulation in this paper is done using 
MATLAB software. Using clustering enables us instead of 
processing massive amounts of images, only image review and 
analysis that are very similar and this is a big step for 
simplifying the problem. 

the processing is only reducing the coefficients of each 
block DCT 8*8, have proposed according to the objective 
function to minimize energy consumption and maximize the 
lifetime of the system, and states image quality [Abou-Chadi 
,2011 [ 2 ]. 

Analysis of each component of DCT blocks show that 
image quality and power consumption are two opposite effects, 
the image is segmented regions of interest, and is expressed to 
different parts of different sizes [4]. 

II. Materials and methods 

In this paper, three-step simulation done and these steps are 
separately . The reason for this was to do a comparison 
between the proposed methods with other methods. Initially, 
fractal coding method is implemented simple. In the next step, 
method of fractal coding has been implemented along with 
algorithm multiple tree. In last method, it was used Fractal 
coding algorithms with Kohonen self-organizing neural 
networks based on clustering algorithm implementation.In 
terms of compression is carried out two operations of coding to 
compress and decoding compressed image reconstruction. The 
more we can raise the compression ratio so that the difference 
between the original image and the coded image is low (better 
image quality) results will be satisfactory. 

Since the method of Fractal coding used to Image 
Compression, because of the high coding, Fractal coding 
method makes have more computational cost.The reason for 
this is that the coding process used for each block, the block 
area searches for the best ambiguous and finally some 
parameters of similarity between good block board and similar 
domain block is used to encrypt blocks. 

In the first stage of book, code can be built where each 
block book, code displayed an index and in the second stage 
takes place coding, it means which each image block 
corresponding with a book-index the code. Continue to 
increase time of coding were discussed Kohonen network 
clustering. 

Implementation of this process is used of a laptop with core 
i7 processor with RAM 4 GB. Implementations were done 
Version 2014 in MATLAB environment. 


III. The study of implementation results 

Initially, the results of simple fractal coding algorithm have 
examined. As well as seen in Figure 1-4, image is used of a 
size of 255 x 258 for image compression. 

As expected, coding and decoding time algorithm for 
operation is fairly long. In addition, the compression rate is 
higher. 



Original image Reconstructed image 


Figure 1. Output from a simple fractal coding method 

Running time algorithm take 1046 seconds which is slightly 
more running time and for the on-line cases can not be a good 
algorithm. As well as, the of PSNR values is obtained 27.4698 
by this method. PSNR value of this method compared to other 
methods proposed in other research show that this method has 
better rate of compression. 



Original image Reconstructed image 


Figure 2. Output from a simple fractal coding method with 
a running time algorithm of 540 seconds. 

Running time algorithm take 540 seconds which is slightly 
more running time and for the on-line cases can not be a good 
algorithm. As well as, the of PSNR values is obtained 34.18 by 
this method. 
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Figure 5. Output from a fractal coding method and multiple 
tree algorithms with a running time algorithm of 220 seconds 


Original image Reconstructed image 

Figure 3. Output from a simple fractal coding method with 
a running time algorithm of 240 seconds. 

Running time algorithm take 240 seconds which is slightly 
more running time and for the on-line cases can not be a good 
algorithm. As well as, the of PSNR values is obtained 34.18 by 
this method. It used multiple trees to solve of problem of a 
method of simple fractal coding. It is used to the image 
segmentation and creating a block for coding. This 
segmentation is important which is presented simple fractal 
coding method using small blocks with a lower compression 
ratio, although larger block sizes increase the compression rate 
and makes the reconstructed image is of lower quality. 



Original image Reconstructed image 

Figure 4. Output from a fractal coding method and multiple 
tree algorithms. 

Running time algorithm take 44 seconds which shows 
change quite dramatically compared with fractal coding 
algorithm. As well as, the of PSNR values is obtained 25 by 
this method. PSNR value is slightly lower in this method than 
before. But coding and decoding time is dramatically reduced. 



Original image Reconstructed image 


original image reconstructed image 


Figure 6. Output from a fractal coding method and multiple 
tree algorithms with a running time algorithm of 1 10 seconds 

Running time algorithm take 110 seconds which is slightly 
more running time and for the on-line cases can not be a good 
algorithm. As well as, the of PSNR values is obtained 33 by 
this method. 

In the next stage, it was used Kohonen self-organizing 
neural networks in purposing of reducing of coding and 
decoding time. The results showed that the algorithm can also 
reduce the time and also maintain image quality. 



Original image Reconstructed image 


Figure 7. The output of SOM- method and vector 
quantization based clustering 

Running time algorithm takes 30 seconds which shows 
decreased compared with the previous two method. As well as, 
the of PSNR values is obtained 24 by this method. 

PSNR value is slightly lower in this method than before, 
but the time of coding and decoding, has been reduced. 


Running time algorithm take 220 seconds which is slightly 
more running time and for the on-line cases can not be a good 
algorithm. As well as, the of PSNR values is obtained 33 by 
this method. 
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Original image Reconstructed image 


Figure 8. The output of SOM- method and vector 
quantization based clustering with a running time algorithm of 
120 seconds 

Running time algorithm take 120 seconds which is slightly 
more running time and for the on-line cases can not be a good 
algorithm. As well as, the of PSNR values is obtained 33.16 by 
this method. The results showed which coding time is reduced 
no significant decrease in the quality of the reconstructed 
image. The following table show comparison of the methods 
implemented in this paper. 


Table 1. Compares results between techniques 
implemented 



coding time 

PSNR 

Fractal method 

1046 

27 

540 

34.18 

240 

34.18 

Fractal and 
Multiple trees 

44 

25 

220 

33 

110 

33 

SOM- method and 
vector quantization 
based clustering 

30 

24 

120 

33.16 

83 

30.65 


IV. Conclusion 

In this study, with the aim of increasing the compression 
ratio, which makes to reduce the image size, is reduced. So 
that the image quality is not significantly different than the 
original image and also aims to reduce the time coding and 
decoding image, it used three different ways to 

implement. Finally, it was compared them with each other. It 
was observed that simple fractal coding method image 
reconstruction time is better quality, although coding and 
decoding time is very high in this method. This problem makes 
the changes to the algorithm which maintains the quality 
reconstructed image, as well as can be reduce the time. 
Multiple tree algorithms could well reduce this time. This 
algorithm could with creating of suitable book to well reduce 
speed in coding and decoding algorithms. In last stage, it was 
used Kohonen self-organizing neural networks based on 
clustering algorithm have provided a further reduction of time. 
While by obtaining a measure of PSNR be able to measure the 
quality of output image. The results showed which coding time 
is reduced no significant decrease in the quality of the 
reconstructed image. 
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Abstract — IEEE 802.15.4 standard is widely adapted for Body 
Area Sensor Networks (BANs) due to its low duty cycle and low 
power operation. However, IEEE 802.15.4 recommends the use of 
fixed duty cycle operation which results in high energy 
consumption and end-to-end delay. Therefore, an efficient 
algorithm is needed to adapt duty cycle operation to overcome the 
end-to-end delay and energy consumption. In this paper, we 
propose a Joint Duty Cycle algorithm (JDCA) for the BAN to 
enhance the network lifetime, throughput and decrease the end- 
to-end delay. Dynamic duty cycle can be adapted by the two MAC 
parameters: Beacon Order (BO) and Super frame Order (SO). 
However, these parameters are set by the network administrator 
before the network deployment. During simulation, JDCA 
algorithm is capable of adapting dynamic duty cycle at run time 
based on traffic load. Furthermore, simulation results shows 
enhanced network lifetime, network throughput and less end-to- 
end delay when compared with IEEE 802.15.4. 

Index Terms — Dynamic duty cycle, IEEE 802.15.4, Body area 
sensor networks, Wireless personal area network. 


I. Introduction 

T he IEEE 802.15.4 [1] is a wireless technology widely 
adapted for low rate Wireless Personal Area Networks 
(WPANs). IEEE 802.15.4 based application ranges from: 
home automation, industrial automation, intrusion detection, 
agriculture monitoring and body area sensor networks. 

Fig 1 shows a WBAN infrastructure for medical and non- 
medical applications. The nature of data reported by BAN 
devices can be on-demand, emergency and periodic. On- 
demand traffic is used by the doctors for the purpose of 
diagnostic; this traffic is normally requested by the doctor or 
observer of the patient. This is further divided into Periodic data 
traffic and continuous data traffic. Periodic traffic can be sent 
by the sensor after some interval of time, possibly when a node 
have some data to send (when information is required 
occasionally), Continuous traffic is sent by the sensor node 
continuously; such as Continuous Glucose Monitoring. 
Emergency traffic is totally unpredictable and is initiated by the 
source nodes when exceeds from a threshold value. Whereas, 
the normal traffic is generated by source nodes in a routine 
condition. PAN coordinator collects the normal routine data of 
a patient and proceed for further processing with no time critical 
and time bound requirements. The PAN coordinator is further 
connected to telemedicine, server for relevant 


recommendations. At the end the data is given to the care giver 
to monitor the health relevant conditions and then treated 
accordingly as per the need. 



In IEEE 802.15.4, the central device (PAN coordinator) 
determines the duty cycle of nodes in a star topology. The PAN 
coordinator transmits BO within the beacon frame. The source 
nodes send data in the active period of beacon interval when 
they have some data otherwise stay in low power mode. In order 
to achieve greater performance and synchronization, IEEE 
802.15.4 uses super- frame structure for the adjustment of duty 
cycle. The duty cycle of WPAN can be adjusted by BO and SO 
according to nature of traffic and network conditions. However, 
IEEE 802.15.4 does not propose any algorithm that can 
dynamically adjust the BO and SO in the super-frame structure 
to meet the WBAN application requirements. 

The rest of paper is organized as follows: 802.15.4 Overview 
is discussed in Section 2. Section 3 outlines existing research 
efforts on BO and SO dynamic scheduling in WBANs. JDCA 
algorithm is discussed in section 4. Section 5 describes the 
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results and critical analysis of JDCA and IEEE 802.15.4, 
whereas, the last section concludes this paper. 

II. IEEE 802.15.4 Overview 

This section discusses the beacon enabled mode of IEEE 
802.15.4. IEEE 802.15.4 is a protocol which support Internet of 
Things (IoT) based applications widely [3]. 

IEEE 802.15.4 is being widely adapted communication 
protocol for Internet of Things (IoT) based applications. These 
application ranges from smart university, home automation, and 
industrial automation, agriculture monitoring weather 
monitoring and smart tracking of objects [4]. To form an IEEE 
802.15.4 based communication network there are two types of 
network devices: Fully Functional Devices (FFDs) and 
Reduced Functional Devices (RFDs). FFDs have more 
capability in terms of energy, topology maintenance, and 
communication control and for network organization. Due to 
their functionality FFDs are also called Personal Area Network 
(PAN) controllers. Whereas, RFDs devices has less 
functionality as compared to the FFDs. RFDs can only 
communicate with the FFDs. These devices together can form 
star, peer-to-peer or tree topology shown in figure 2. In star 
topology, the end devices are connected to the one main 
controller, usually called “PAN coordinator”. The PAN 
coordinator usually transmits beacons to the connected devices 
to send data to the PAN coordinator and synchronization. 

IEEE 802.15.4 is operated in two modes of communication: 
Beaconless and Beacon-enabled mode. Beaconless mode is 
used where we do not need any energy constraints. On the other 
hand, beacon-enabled mode is used in networks where energy, 
throughput and delay are the primary concerns. Figure 3 
represents the detailed overview of the two modes of 
communication based on IEEE 802.15.4. A super- frame 
structure is used by the PAN coordinator in a beacon-enabled 
mode for the purpose of data communication. Super frame 
structure is bounded by beacons as shown in figure 6.3. The 
super frame structure is divided into two main portions: active 
and inactive portions. The Inactive portion is the time period 
where nodes and PAN coordinators stay in sleep mode which is 
usually called the low power mode. Active portion is further 
divided into two portions i.e. Contention Access Period (CAP) 
and Contention Free Period (CFP). CAP is the time duration 
which is divided into 1 6 time slots of equal length. A single slot 
duration is equal to 60 symbols. CAP portion is depicted by the 
super frame duration (SD). SD depends on the value of the 
super frame order (SO) while beacon order is identified by the 
Beacon Interval (BI). Whereas, CFP is an optional mode of 
communication, which uses guaranteed Time Slots (GTSs). 
GTS is used for those application which requires the guaranteed 
bandwidth for specific period of time. 



(Q) PAN Coordinator 
© Full Function Device (FFD) 

© Reduced Function Device (RFD) 



Fig 2. IEEE 802. 1 5.4 based topologies 



Fig 3. Modes of operation 


Beacon Beacon 
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The BI, SD, DC and sleep period can be calculated by the 
below equations. 

aB as e Super Frame Duration 

= aBaseSlotDuration 
x aNumSuperSlot (1) 

BI = aB as e Super Frame Duration 

x 2 50 (0 < BO < 14) (2) 

SD = aB as e Super Frame Duration 

x 2 b0 (0 <BO < 14) (3) 

Sleep Period = BI — SD (4) 

SD 

Duty Cycle = DC = — (5) 

BI 

III. Related Work 

This section briefly surveys the existing dynamic BO and SO 
adjustment schemes in the area of body area sensor networks. 
In existing literature, few efforts are focused at improving the 
performance of IEEE 802.15.4 based beacon-enabled networks 
by altering only one of the MAC layer parameters, either BO or 
SO or both simultaneously. 

The Beacon Order Adaptation Algorithm (BOAA) [5] and 
Individual Beacon Order Adaption Algorithm (IBOAA) [6] use 
BO adjustment for power saving purposes. These schemes 
increase the sleep mode of nodes by increasing BI during idle 
network state. However, by fixing SO at a smaller value, the 
overall throughput is decreased and latency of information 
delivery is increased. This is because of greater inactive portion 
within the super-frame as compared to active portion. 
Therefore, BOAA [5] and IBOAA [6] does not perform 
efficiently when the traffic nature is unpredictable. 

The Duty Cycle Algorithm (DCA) [7] adjusts duty cycle by 
fixing BO, so it does not meet real time data requirements. The 
Dynamic Super-frame Adjustment Algorithm (DSAA) [8] 
alters only SO parameter, with the primary goal of energy 
consumption. In [8], the active duration within super- frame is 
changed when application requirements are not meet by current 
SO. The PAN coordinator dynamically adjusts the duty cycle 
by adapting SO whereas, the BO is kept fixed, as a result packet 
delivery delay is increased. In 9], the coordinator detect 
congestion by increased contention during the active period and 
control congestion by dynamically setting SO according to 
current network conditions. Although, congestion is controlled 
but the BO and SO values are not considered for decreasing 
duty cycle. Also, this solution [9] increases the delay of 
information delivery under certain network conditions. 

In [10], the Markov-base theoretical analysis is proposed for 
energy saving and to meet the application requirements. 
However, this model is designed by considering fixed network 
traffic patterns and require fine tuning of different variables. 
This model [10] is not validated using simulations or test beds. 

Duty Cycle Self- Adaptation Algorithm [11] modifies the 
duty cycle based on adjusting both BO and SO parameters in 


IEEE 802.15.4. In the DBSAA algorithm [1 1], the CAP portion 
is adjusted in a star topology under beacon enabled mode 
operation. DBSAA is based on the findings presented in [7]. 
DBSAA algorithm is based on these three steps; firstly, 
DBSAA estimates network load, secondly it determines 
changes in the network load, thirdly calculates a factor a which 
assigns the number of BI that the coordinator should wait before 
applying the DBSAA Algorithm. 

Load Adaptive MAC protocol (LA-MAC) [12] proposed for 
the body area sensor network based on IEEE 802.15.4 by 
considering these parameters: traffic load, number of source 
members participating in communication and network delay. 
This primary goal of this protocol to adjust the BO and SO in a 
star topology to enhance the network life time of the network. 
The algorithm adapts the BO and SO at run time based on the 
above parameters. LA-MAC [12] shows enhanced network 
performance in terms of energy consumption, end-to-end delay 
and network throughput. 

In [13, 14], the algorithm goals to adjust both BO and SO 
based on traffic load to enhance the network throughput. The 
algorithm works in three major steps: First, TDSA calculates 
the expected SO based on the data rate generated by source 
nodes. Secondly, the TDSA waits for three consecutive BI 
intervals to alter the SO and BO based on the network 
requirements. Finally, the network delay is checked to further 
enhance the SO value if the previous SO does not guarantee the 
maximum network throughput and network delay. However, 
this algorithm consumes more energy as it achieves its primary 
goal of network throughput by adjusting both the BO and SO 
simultaneously. 

IV. Joint Duty Cycle and Optimal Energy Adaptation 
Algorithm 

This section describes the proposed algorithm called joint 
duty cycle and optimal energy adaptation algorithm (JDCA). 
The detailed working of JDCA is shown in figure 5. 

In this algorithm the PAN coordinator is responsible for the 
duty cycle and optimal energy adjustment and values of these 
parameters are calculated after every BI. When the nodes in the 
network are not communicating and network is in idle state, 
default values of BO and SO are used (BODefauit = 4 and SOoefauit 
= 2). Smaller values of SO as compared to BO is used to 
decrease the energy consumption during the idle network time. 
The detailed working of algorithm is explained in the rest of this 
section. 

Initially the SO Next is determined for the next BI by the 
following equation: 

SO Next = SN x NumPkt x NumPktSize 

X PktTransmissiontime (6) 

In the above equation (6) the SN represents the Number of 
source nodes. NumPkt is the number of packets that a single 
source node can generate. NumPktSize is the size of the single 
packet PktTransmissiontime is the time is the time required for a 
single packet delay from the source node to destination node. 
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Fig 5. Joint duty cycle and optimal energy adaptation algorithm 


Then the SO Next is checked whether it is equal to zero or not, 
if it is equals to zero it means there is no network activity. Thus, 
the communication is carried out with a default SO = 2 to 


TABLE I 

Simulation Setup 


Parameter 

Description 

Simulation Time 

900 seconds 

Range 

15 meter 

Frequency 

250 kbps 

JDCA 

DODefault — 6 
SOoefault — 2 

IEEE 802.15.4 

BO = 6 

SO = 2, 3, 4 and 5 

Traffic Type 

CBR 

Packet Size 

60 Byte 

Number of source nodes 

5 

Data Rate 

100 and 150 kbps 

Transmit mode 

12.3 mA 

Receive mode 

14 mA 

Sleep mode 

0.02 jliA 

Idle mode 

0.4 mA 


conserve less energy whereas, BO is decremented by 1 to 
decrease the length of BI for the next interval to avoid 
disassociation of nodes during no network activity. 

On the other hand, when SO Next is not equals to zero, then it 
checks the network load. A suitable SO Next is chosen based on 
the data traffic and number of source nodes. 

Furthermore, the receive ratio is checked with a predefined 
threshold. Receive ratio is computed based on the following 
equation: 


PAN 


RecvRatio 


/ NumPktTotal \ 
\SN x NumPkt) 


x 100 (7) 


In the equation (8) NumPktTotal is the total number of 
received packets at PAN coordinator. 

In case, the PANR eC vRatio is greater than the threshold, then 
SONext becomes the SOc ur and BOp re vious becomes the next BO 
for communication. It is because the network requirements are 
met and both BO and SO does not need to be altered. However, 
when P AN RecvRatio is less than the defined threshold value, SOcur 
and BO is incremented by 1 respectively to increase the super- 
frame duration for the next interval to meet the network 
requirements. 

V. Results and Discussions 

In this section, the detailed performance of proposed 
algorithm is evaluated against IEEE 802.15.4. Comparisons are 
based on packet delivery ratio, throughput, energy consumption 
and end-to-end latency. The simulation analysis is performed 
using network simulator NS-2 [15]. 


L U J 



Fig 6. Simulation setup 

The network is configured based on star topology and nodes 
communicate with PAN coordinator in a beacon enabled mode. 
The simulation setup is shown in figure 6. There are 5 nodes 
randomly positioned in 15 meter radio range with PAN 
coordinator placed in the middle. To have more realistic and 
actual results we used the ATMEL mote [16] for node’s energy 
model. The rest of simulation setting are shown in table 1 . 

Throughput of JDCA and IEEE 802.15.4 is shown in figure 
7. In figure 7 the overall data rate is 100kbps for both JDCA 
and IEEE 802.15.4. It is observed that IEEE 802.15.4 performs 
the same at IFQ length = 10, 20, 30, 40 and 50 but does not 
reach the performance of JDCA. It is because, the IEEE 
802.15.4 uses fixed BO and SO which results in degraded 
network performance. However, JDCA outperform IEEE 
802.15.4 in all cases due its dynamic behavior of duty cycle 
adjustment regardless of IFQ length. 
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IFQ l ength 

Fig 7. Throughput of JDCA and IEEE 802. 1 5.4 


Figure 8 shows the average receive ratio of JDCA and IEEE 
802.15.4. It can be seen from the figure that dynamic duty 
cycling of nodes results in enhanced performance as compared 
to IEEE 802.15.4. Therefore, JDCA manages the super- frame 
structure according to the traffic and increase or decrease the 
BO and SO values at run time when receive ratio is observed 
less at PAN coordinator. 

End-to-end delay for the JDCA and IEEE 802.15.4 is shown 
in figure 9 at the PAN coordinator against interface queue 
length. A decrease in end-to-end delay is observed with the 
increase of SO from 2 to 5. Therefore, as the super- frame 
duration is increased the packet delivery of packets are 
increased as a result decrease in end-to-end latency is observed. 
However, when the results of IEEE 802.15.4 is compared with 
JDCA, it outperforms IEEE 802.15.4 because of its dynamic 
behavior. 
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Fig 8. Receive ratio of JDCA and IEEE 802.15.4 


shifts BO and SO to its default values to decrease the energy 
consumption at maximum level. 



Interface queue lerngin 


■■ JDCA (default) 

M IEEE 002.15.4 at <BO=6: SO=2) 
E5551 IEEE B02.15.4 at (BO=B: SOB ) 
I— I IEEE BQ2.15.4 at (BQ=S; SO=4) 
HUM IEEE B02 T5 4 at (BOB; SOS) 


Fig 9. End-to-end delay of JDCA and IEEE 802. 1 5.4 
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Fig 10. Per bit power depletion of JDCA and IEEE 802. 15.4 
VI. Conclusion 

In this research paper, we proposed an algorithm for the body 
area sensor network which adapts the duty cycle according to 
the network requirements to decrease the energy consumption, 
and end-to-end latency of communication and to enhance the 
network throughput. PAN coordinator is responsible to 
compute super- frame duration for the next BI based on traffic 
load. During no network activity JDCA changes the BO and SO 
to its default values to conserve less energy. Finally, results 
show that JDCA performs better than IEEE 802.15.4. In future, 
JDCA algorithm will be implemented in a multi-hop tree 
topology. 
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Abstract — This paper presents the evaluation performance of 
broadband hybrid satellite constellation communication system 
(BHSCCS) networks which provides high performance data 
transfer in grid network environment based on TCP protocols. 
The evaluated hybrid satellite network uses the 
COMMStellation™ constellation topology on lower orbital. We 
adopt the GridFTP to improve network performance. GridFTP 
is a high-performance, reliable data transfer protocol optimized 
for high-speed Internet to suitable WAN networks. The 
simulation results show the network performance of GridFTP 
which different AQMs, TCPs, PERs, over BHSCCS networks. 

Keywords: COMMStellation ™; GridFTP; Hybrid Satellite; 
Queue; TCP 

I. Introduction 

Satellite is the only choice for long-haul wireless broadband 
global area network communication. Some countries with 
large and/or island geographic areas and/or hard-to-reach areas, 
such as China, India, Indonesia, are likely to benefit the 
greatest from satellite broadband. There are a huge big data 
internet accesses by users. [1] Ever-increasing demands in 
high-end computing and intensive data exchange, together with 
the cost effectiveness of high performance commodity systems, 
have led to massive deployment of compute and storage 
systems on a global scale. In such Grid-based scenarios, bulk 
data transfer within and across physically separate clusters or 
data centers has become an inescapable requirement for 
scientific data set distribution, content replication, remote data 
backup, and so forth. Generally, File Transfer Protocol (FTP) 
is used for handling bulk data transfers. Since the earliest FTP 
implementation based on Transport Control Protocol / Internet 
Protocol (TCP/IP), many efforts have been undertaken to 
improve and extend it. GridFTP, designed specifically for 
high-bandwidth wide area networks, has appeared as the most 
popular FTP implementation in the Grid environment. 

On the other hand, high-performance interconnections such 
as InfiniBand (IB) and Gigabit Ethernet/iWARP are rapidly 
gaining momentum for designing high-end clusters and data 
centers. In addition to providing high bandwidth and low 
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latency, they provide advanced features, such as zero-copy 
communication and Remote Direct Memory Access (RDMA), 
which enable the design of novel communication protocols and 
libraries. The recent introduction of IB WAN (Wide Area 
Network) router allows us to extend these capabilities across 
multiple campuses or even across WAN distances. 

Hence, it is the most challenging to implement high 
performance data transfers in the Grid environment over 
broadband hybrid satellite constellation communication 
system. For the future generations of the internet, broadband 
hybrid satellite access and QoS are surrounded by the most 
significant issues to be solved. 

II. Background and related works 

In coherence to this research, we have implemented 
GridFTP, broadband hybrid satellite network, TCP over 
satellite, and queue management to analyze the performance 
evaluation based on the Broadband Hybrid Satellite 
Constellation Communication System (BHSCCS) topology 
model. 

A. GridFTP 

GridFTP [1] is a protocol extension FTP and its existing 
extensions are used to manage large data transfers across 
computational grids. GridFTP is a high-performance, secure, 
reliable extension of the standard FTP data transfer protocol, 
optimized for high-bandwidth, wide area networks. The 
Globus implementation of GridFTP provides a software suite 
optimized for a broad range of data access applications, 
including bulk file transfer and data extraction from complex 
storage systems. The following are some of the key advantages 
of GridFTP which are a) Performance: enhanced performance 
through use of parallel streams and coordinated data transfers 
b) Security: PKI/X.509 and SSH-based Grid security c) 
Robustness: restart markers allowing interrupted transfers to 
restart with minimal delay overhead d) Extensibility: easy-to- 
use Open/Close/Read/Write (OCRW) interface to users and 
applications. 
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GridFTP relies on Transport Control Protocol (TCP) [2], 
the most widely used TCP for the Internet as well as 
computational grids. Such grids are often characterized by 
networks with large bandwidth-delay products (BDP). 
Currently, however, the default flow-control parameters in TCP 
are statically tuned to suit networks with small BDPs, and thus 
perform abysmally over today’s grid networks with large 
BDPs. 

[2] Tuning of buffer sizes is necessary for maximizing 
throughput with TCP. For any given connection, the optimal 
TCP buffer size is equal to the BDP of the connection. Often, 
grid researchers manually tune the buffer sizes to keep the 
network pipe full by using diagnostic tools to determine the 
round-trip time (RTT) and bandwidth of the connections. 

In addition of GridFTP, to simulate bulk data transfer the 
three major parameters defined for the GridFTP simulator are 
Bandwidth, Parallel, and Ratio. The Bandwidth is used to set 
the total bandwidth of the satellite link. The Parallel is used to 
set the parallel GridFTP steams. It is set to 4. The Ratio is 
used to set the throughput ratio among the parallel streams. By 
default, it is set to 1 : 1 : 1 : 1 , which means that each GridFTP 
stream will transmit packets at an equal speed. 

B. Broadband Hybrid Satellite Network 

Broadband satellite systems have been an important 
component of telecommunications networks for many years 
serving, in particular, long distance telephony, data, and 
television broadcasting. The involvement of broadband 
satellite in Internet protocol (IP) networks is a direct result of 
new trends in a global telecommunications, where Internet 
traffic will hold a dominant share in the total network traffic. 
The large geographical coverage of the satellite footprint and 
its unique broadcasting capabilities, as well as its high-capacity 
channel combined with readily available Ka-band spectrum 
will retain broadband satellite systems as an irreplaceable part 
of communications systems. 

The COMMStellation™ [3] is adopting for implementation 
in this research. COMMStellation™ is a high-power satellite, 
with the latest in advanced technology, built by Microsat 
Systems Canada Inc. (MSCI). The COMMStellation™ group 
deploy and operate a constellation of 72 satellites plus spares 
divided into six polar, low earth orbit (LEO) orbital planes at 
an altitude of 1000 km designed to provide high-speed, global 
business communications. This is a typical Walker 
constellation, providing worldwide coverage assuming ground 
contacts. The bandwidth of satellite is 8.8 Gigabits, uplink, and 
downlink are 1.1 Gigabits. The key parameter concept for 
implementation into network simulator is [4]. The 
COMMStellation™ will be capable of providing high-speed 
data communications for many applications. Some potential 
uses of this system for communications include a) Strategic 
communications for government, military, or corporations b) 
Special purpose low latency communications, and c) High 
bandwidth burst data transmission from remote sensors, ships, 
exploration vessels, sites. 


C. TCP over Satellite 

Transmission Control Protocol (TCP) is a transport protocol 
used by many Internet applications for end-to-end reliable data 
delivery. TCP performs well in terrestrial networks. Given the 
widespread use of TCP based applications and interconnection 
with the terrestrial Internet, it is likely that broadband satellite 
networks will transport large amounts of traffic generated by 
TCP’s algorithms. The desire to utilize satellite capacity 
efficiently has led to the development of number of optimized 
for satellite TCP variants, with altered transmission control 
behavior. They are run between ground terminals across the 
satellite link, while other TCP implementation are across the 
terrestrial Internet [5], [6], [7]. 

TCP Tahoe [8] being the first implementation of TCP to 
involve a few new algorithms in early TCP implementations 
likes Slow Start, Congestion Avoidance, and Fast Retransmit. 

TCP New Reno [8] includes a change to the Reno 
algorithm at the sender end with a view to eliminate Reno’s 
wait for a retransmit time-out whenever multiple packets are 
lost from a window. This change modifies the sender’s 
behavior during fast recovery. 

TCP Westwood [9] is a sender-side-only modification to 
TCP New Reno that is intended to better handle large 
bandwidth-delay product paths, with potential packet loss due 
to transmission or other errors, and with dynamic load. 



Figure 1 . The network model 

In Figure 1 is shows the TCP flow data transfer packets into 
client. The network consists of queue model and error model 
for realistic situation. We are used this figure to implement our 
simulation in broadband hybrid satellite network model. 

D. Queue Management 

Queue management in routers is an important role in taking 
care of congestion network traffics. Two procedures are 
adopted to solve the problem. Firstly, is congestion avoidance 
preventive technique, which comes into play before traffic in 
network is congested by overloading. Lastly is congestion 
control, which comes into play after the congestion at a traffic 
network has appeared and the traffic network is overloaded. 

DropTail, it is implements FIFO (First In, First Out) 
scheduling and drop-on-overflow buffer management, which 
passive queue management. According to passive queue 
management, packets are dropped only when the buffer is full. 

RED [10] is the most well know queue algorithm. It is 
Random Early Detection (RED) gateway. Stochastic drops 
allow RED to avoid global synchronization and unfairness 
against burst traffic when used in conjunction with TCP based 
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flows managed. This moderation maintains a moving average 
of the queue length to manage congestion. If this moving 
average of the queue length impostures between a minimum 
threshold value and a maximum value, then the packet is either 
marked or dropped with a probability. But if the moving 
average of the queue length is greater than or equal to the 
maximum threshold then the packet is dropped. Even though, 
it tries to keep away from global synchronization and has the 
ability to accommodate transient bursts, in order to be efficient 
RED must have sufficient buffer spaces and must be correctly 
parameterized. 

BLUE [11] has been shown to perform significantly better 
in terms of packets loss rates and buffer size requirements in 
the network. If buffer overflow causes the queue to recurrently 
drop packets, BLUE increments the making probability, thus 
augmenting the rate at which congestion notification is sent 
back. In addition detail, BLUE maintains a single probability, 
which it uses to mark or drop packets when they are queued. If 
the queue is continually dropping packets due to buffer 
overflow, BLUE increments the marking probability, thus 
increasing the rate at which it sends back congestion 
notification. Conversely, if the queue becomes empty or if the 
link is idle, BLUE decreases its marking probability. 

III. Traffic scenarios 

Traffic analysis and modeling are integral parts of 
engineering broadband telecommunications network, including 
broadband hybrid satellite network systems. The analysis is 
related with getting statistical properties of the traffic and 
systematical solutions for the evaluating performance of 
networks, in terms of different measures such as packet delay. 
We have also established the presence of self-similarity in 
many scenarios, which is important for predicting network 
behavior. 

The complexity and dynamics of hybrid satellite networks 
prevent performance evaluation of GridFTP algorithm using 
high performance data transfer analytic expression. For testing 
and analyzing various different packet error rates, various 
different TCPs protocol, and various different queues 
algorithm, we have, therefore, developed the GridFTP over 
broadband hybrid satellite network simulation schematically 
illustrated in Figure 2. In our experiments, four user station 
links in different areas around Asia are chosen to observe 
network traffic. The packet size of GridFTP is 1514 bytes 
excluding the header size. We run simulation for 100 seconds 
for all scenarios, and have a full run of GridFTP with 
maximum user stations both of source and destination. 

As for the simulation based on the grid traffic model, we 
are able to build simulator using NS -2 to accurately replicate 
the characteristics of the applications we are considering. This 
simulator is useful in validating our network traffic models and 
is ultimately released publicly so interested parties are able to 
evaluate different types of grid traffic in BHSCCS [4] 
networks. 



Figure 2. The network infrastructure for global area bulk data transfer 

IV. Results and discussion 

Simulations have been used to compare the GridFTP 
performance of different services employing Transport Control 
Protocol (TCPs) such as TCP Tahoe, TCP New Reno, TCP 
Westwood, Active Queue Management (AQMs) such as 
DropTail, RED, BLUE, Packet Error Rate (PERs) such as 1 - 
10 percentiles over BHSCCS system. 

Throughput is the main performance measure characteristic, 
and most widely used. This measures how soon the receiver is 
able to get a certain amount of data sent by the sender. It is 
determined as the ratio of the total data received to the end to 
end delay. Throughput is sometimes different from goodput, 
because goodput consists solely of useful transmitted traffic, 
where as a throughput may also include retransmitted traffic. 
Throughput is an important factor which directly impacts the 
network performance. 

End-to-End delay is the time elapsed while a packet travels 
from source to destination. The larger a value of delay, the 
more difficult it is for transport layer protocols to maintain high 
band widths. This characteristic can be specified in a number of 
different ways, including average delay, variance of delay 
(jitter), and delay bound. 

The research has found out that a principal reason for such 
high expectations from user station is the amount of throughput 
in simulation time. The total simulation time of the research is 
100 seconds. The simulations are done, the throughput results 
are shown in Figure 3, 4, and 5 (DropTail, RED, BLUE) 
respectively; in these figures comparison of the TCP variants’ 
performance. Each TCP variant has its own advantages or 
disadvantages. But in this research, we will focus on the most 
balanced of GridFTP for network performance tradeoffs in the 
BHSCSS scenario. 

The experiment shows the comparison of throughput in 
different AQMs, TCPs, and PERs schemes over time. The 
TCP scheme results show an average performance of 
throughput. The throughputs in Figure 3 TCP Westwood case 
on DropTail queue are the highest TCP New Reno and TCP 
Tahoe in any PERs. But, the average throughput performance 
in RED queue in Figure 4 is the lowest in all dimensions when 
compared with DropTail queue and BLUE queue in Figure 5. 
Thus, the results from the analysis of GridFTP application in 
this simulation is highest average throughput in BLUE queue 
and TCP Westwood all PERs. 
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t— 0 rj,, , , Figure 6. Average end-to-end delay of GridFTP with Droptail queue 

Figure 3. Throughput or GridFTP with Droptail queue & & j r ^ 


GridFTP with Red queue over Hybrid Satellite 



Figure 4. Throughput of GridFTP with RED queue 


GridFTP with RED queue over Hybrid Satellite 



Figure 7. Average end-to-end delay of GridFTP with RED queue 


GridFTP with Blue queue over Hybrid Satellite 



Figure 5. Throughput of GridFTP with BLUE queue 


GridFTP with BLUE queue over Hybrid Satellite 



Figure 7. Average end-to-end delay of GridFTP with BLUE queue 
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The research of highlighted the fact that average end-to-end 
delay time owners must be aware, and must implement, fair 
usage policies in order to prevent bandwidth from being 
depleted to use bandwidth more productively. Figure 6, 7, 8 
are show end-to-end packet delay in different AQMs and 
PERs. 

The total simulation time of this research is 100 seconds. It 
has 3 TCP flows which start at the beginning of the simulation. 
A comparison of the TCP variant’s performance with different 
AQMs, and PERs, illustrated Figures 6, 7, and 8. 

The end-to-end delay will be reflexive with the throughput; 
its mean being higher throughput equal higher end-to-end 
packet either. In illustrated below shows an average end-to-end 
packet delay in different TCPs and PERs over DropTail, RED, 
and BLUE queue respectively. The graph shows TCP 
Westwood has higher end-to-end packet time than others, 
reflecting the highest throughput in the previous graph. The 
figures show the network with respect to different PERs. The 
results are analyzed and a comparison on graphs all the 
transmitted packets being received successfully, the throughput 
and end-to-end delay were much affected by the rapid change 
in queuing of the packets during the fast data transfer. The 
GridFTP transmission of packet has a positive impact on the 
throughput but a negative impact on the delay. 

In addition, an average end-to-end delay is calculated from 
median point in increasing the errors in scenario. We have 
counted the total packets that arrive in destination and 
successfully transfer bulk data. And, we used same method in 
the throughput for the median point, too. 

V. Conclusion 

In this research, we have simulated the grid environment 
using high speed data transfer over broadband hybrid satellite 
constellation communication system by us network simulation 
tool. As shown in the results, the AQMs, TCPs, and PERs has 
an obvious impact on the network performance for high 
performance data transfer in grid environment. The GridFTP 
load has been implemented for traffic network load of grid 
network to investigate efficient ways of dynamically changing 
on broadband hybrid satellite network communications 
scenario. In addition, we study the behavior grid environment 
in a hybrid satellite network to prove the impact quality of 
services on a network performance. 

Finally, it is apparent that evaluated the network 
performance of the GridFTP with active queue management 
BLUE queue, and TCP Westwood over hybrid satellite results 
give an expression of the most viewpoint of higher 
performance data transfer on broadband hybrid satellite 
network systems. 
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Abstract — Wavelet Neural Network (WNN) is attracting interest 
in field of classification system, because they are universal 
approximations, particularly due to rapid and accurate 
representation of nonlinear dynamic systems. The satisfying 
performance of the WNN depends on an appropriate 
determination of the Wavelet Neural Network structure. 

In this paper we provide a new method to solve this problem 
based on the Least Absolute Shrinkage and Selection Operator 
(LASSO). At first, the scale of WNN is managed by using the 
time-frequency locality of wavelet. Furthermore, the 
unconstrained optimization problem (LASSO) is used to solve the 
structure and learning of the WNN. This optimization problem 
can be solved efficiently using the iteratively reweighted least 
squares (IRLS) and the Least Trimmed Square (LTS) methods to 
enhance the ineffectiveness; they are applied to train the wavelet 
neural network. The advantage of the method lies in the oracle 
properly of the LASSO can guarantee the optimal structure of 
the WNN. The proposed method has been able to optimize the 
wavelet neural network and this method is able to classify the 
DNA sequences. Our goal is to construct predictive models that 
are highly accurate. In fact, the proposed method permits to 
avoid the complex problem of form and structure in different 
clusters of organisms. The empirical results and their 
classification performances are compared with other methods. 

We compared the WNN-Lasso model with the other five 
alignment-free models, i.e., k-tuple, DMK, TSM, AMI, and CV, 
on several large-scale DNA datasets on the DNA classifying 
application by means of the K-means method. 

The experimental results have shown that the WNN-Lasso model 
outperformed the other models in terms of both the classifying 
results and the running time. 

Evenly, in this study, we present our approach consists of three 
phases. The first one, which is called transformation, is composed 
of two sub steps; binary codification of the DNA sequences and 
the Signal Processing of the DNA sequences. The second phase 
step is the approximation; it is empowered by the use of the Multi 
Library Wavelet Neural Networks (ML WNN). Finally, the third 
section, which is the classification of the DNA sequences, is 
realized by applying the algorithm of k-means classification. 

Index Terms — LASSO, LTS, Wavelet Neural Networks, DNA 
sequences, ML WNN, IRLS. 
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I. Introduction 

r J/he Wavelet Neural Network (WNN) has recently attracted 

extensive attention for its ability to identify effetely nonlinear 
dynamic systems with incoherent information [1-2-3-4-5]. The 
WNN were introduced by Zhang and Benveniste [1-2-3] in 
1992 as combination of the artificial neural network wavelet 
decomposition. The generalization performance of the WNN 
trained by least- square approach deteriorates when outliers are 
present. This training approach involves estimating parameters 
in the wavelet network to minimize some function costs, a 
measure reflecting the approximation quality performed by the 
wavelet network over the parameter space in the network. 
However, the studies of the WNN have often concentrated on 
a small dimension [6] . The reason is that the complexity of the 
Wavelet Neural Network structure will increase exponentially 
with the input size, i.e the curse of dimensionality to improve 
the performance of the WNN in high dimension application. 
Feed forward neural network such as multilayer perceptrons 
(MLP) and the radial basis function networks (RBFN) have 
been used as an alternative approach to function 
approximation. 

The network structure has been studied by several researchers. 
The research effort has been made to deal with this problem 
over the last decades [6-7-8-9]. The method, referred as 
Matching Pursuit (MP), was first introduced by Mallat [10]. 
The Residual Based Selection (RBS) algorithm is used to 
select the wavelet function for constructing the WNN[10], the 
Orthogonalized Residual Based Selection (ORBS) proposed in 
[6] and the Orthogonal Least Square (OLS) suggested in[8] are 
both the popular approaches. 

The Wavelet Neural Network (WNN) is attracting interest in 
field of classification system. Such as classification of the 
DNA sequences using artificial neural networks [11]. The 
neural networks have several unique features and advantages. 
They can build non-linear decision boundaries between the 
different groups in a non-parametric fashion, and thereby offer 
a practical approach for solving highly complex pattern 
classification problems. 

Classification of the DNA Sequences using the Wavelet 
Variance and the Self-Organizing Map with an Application to 
Mitochondrial DNA[12]. This approach combines the wavelet 


280 


https://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



International Journal of Computer Science and Information Security (IJCSIS), 
Vol. 14, No. 6, June 2016 


analysis and the self-organizing map algorithm. The Wavelets 
are applied to select variation across various scales in the short 
sequence of nucleotides patterns of the DNA sequence. The 
variation is computed by the estimated wavelet variance, 
which yields a feature vector, which obtained from many 
genomic sequences, possibly of different dimensions, is then 
clustered with a nonparametric self-organizing map scheme. 
The classification of two types of the DNA sequence studied 
20 sample artificial DNA sequence whose types have been 
known are given to cluster the types of other DNA sequences. 
The Wavelet analysis is applied to choose the features of the 
sample DNA sequences [13-31]. 

The Wavelet analysis of frequency chaos game signal has been 
used to classify DNA sequences. The results creating from the 
complex Morlet wavelet analysis of the frequency chaos game 
signals have shown its accuracy in detection of the DNA 
structures. Additional, this could provide in discovering 
unknown domains with potential biological significance in 
genomes [14]. 

A neural network classification method [15] has been 
developed as an alternative method to the search problem of 
large molecular databases of the DNA sequence. Two artificial 
neural systems have been implemented on a Cray 
supercomputer for rapid nucleic acid sequence classifications. 
The used neural networks are three-layered, feed-forward 
neural networks that employ back-propagation learning 
algorithm. 

Since a DNA sequence can be can be converted into a 
sequence of digital signals, the feature vector can be 
constructed in time or frequency domains. Despite, most 
traditional alignment-free methods, such as k- tuple [16], 
DMK[17], TSM [18], AMI [19], and CV [20] methods build 
their feature vectors only in the time domain, i.e., they apply 
direct word sequences. 

The WFV method uses the discrete wavelet transform (DWT) 
to adaptively decompose and select features of the DNA 
sequence matching to its length rather than the DNA 
microarray data. As a result, sequences of distinct length can 
be converted into the same-sized feature vector [21]. 

In this study, our approach applies the Least Absolute 
Shrinkage and Selection Operator (LASSO) to solve the 
wavelet neural network structure. This approach is a useful 
tool to solve the shrinkage and the variable selection 
simultaneously, which was used successfully in COX model 
selection [22]. The lasso reduces the residual sum of squares 
subject to the sum of the absolute value of the parameters 
being less than a constant. 

Because the Least Absolute Shrinkage overcomes deficiency 
of traditional methods, it has been widely educated in 
statistics, employ mathematics as well as signal processing 
[ 22 ]. 

This study is structured as follows: section II introduces an 
overview of our approach. Section III is about the Beta 
wavelet function theory. This function will be applied to 


construct the Wavelet Neural Network. Section IV deals with 
the experimental results of the proposed approach used to 
classify the DNA sequences and Section V ends up with a 
conclusion and discussion. 

II. METHODS 

This paper presents a new approach of classification of the 
DNA sequences based on the wavelet network, which is 
constructed by applying the Library Wavelet Neural to 
approximate f(x) of the DNA sequence. The unconstrained 
optimization problem (LASSO) is used to solve the structure 
and the learning of the WNN. This approach is divided into 
three stages: approximation of the input signal and cluster of 
the compact signature of DNA sequences using Wavelet 
Neural Network (WNN) and the k-means algorithm. 

A. Conversion of DNA sequence to genomic signal 

The proposed cluster of species in class made according to the 
DNA sequence components, which is composed of four basic 
nucleotides, A( Adenine), G(Guanine), C(Cytosine) and 
T(Thymine), where each species is identified by its DNA 
sequence [25]. 

Linear feature extraction can be viewed as finding a set of 
vectors which represent effectively information content of an 
observation while minimizing the dimensionality. The method 
of indicator converts the data into digital signal, which can be 
used for the DNA signal spectrum indicates 1 or 0 for the 
existence or not of a specific nucleotide at the DNA sequence 
level. The binary indicator sequence is formed by replacing the 
each nucleotide with values either 1 or 0. 1 stands for presence 
and 0 for absence of a particular nucleotide in specified 
location in the DNA signal [25]. For example, if x[n] =[T T A 
T G TC ...], we obtain: x[n] =[0001 0001 1000 0001 0010 
0001 0100 . ..] 

B. Fourier Transform and Power Spectrum Signal 
Processing 

After the DNA sequence has been translated into these 
indicator sequences, they can be manipulated with 
mathematical models. The discrete Fourier Transform is used 
to each indicator sequence x(n) and a new sequence of 
complex numbers, called f (x) , is obtained: 

/(*) = TX e (n)e~ im,N ,k = 0,1,2, ...AT - 1. (1) 

n = 0 

It is easier to use with sequence Power Spectrum, rather than 
original discrete Fourier Transform. The Power Spectrum 
Se[k] for frequencies k= 0, 1,2, ..., N-l is presented as, 

M£] = |/(x)| 2 (2) 

Se [k] has been plotted (Fig. 1). 
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Fig. 1 Signal of a DNA sequence using Power Spectrum 

C. Wavelet Neural Network and Time-frequency Analysis 
The combination of the wavelet transform and the artificial 
neuron networks defines the concept of the wavelet networks. 
This network applies the wavelet functions instead of the 
traditional sigmoid function as a transfer function of each 
neuron. It is composed of three layers (an input layer, a hidden 
layer and an out layer) and it has the same structure as the 
architecture radial function. The salaries of the weighted 
outputs are added. Each neuron is connected to the other 
following layer. The Wavelet neural network (Fig. 2) is 
presented by pondering a set of wavelets dilated and translated 
from one wavelet candidate with weight values to 

approximate a given signal f. the overall response of the 
Wavelet neural network is 


A N w 

y = Y, 


i = 1 


x - bi 
ai 


N i 

+ Yjd k X k 

k = 0 


(3) 


where y is the response of the wavelet neural network, (xl, 
x2, . . . ,xNi ) is the vector of the input and Nw is the number of 
wavelets. There, it is often useful to consider, besides the 
decomposition of wavelets cleanly, that the output can have a 
component refine in relation to the variables of coefficients a k 
(k = 0, 1... Ni) (Fig.2). 

Wavelet occurs in family of function and each is presented by 
the dilation ai which controls the scaling parameter and the 
translation bi which supervises the position of a single 
function, called the mother wavelet ^(x). 

The Wavelet Neural Network can be noticed as function 
approximator which estimates an unknown function mapping: 

y = f(x) + £, (4) 


Where f is the regression function and £ is the error term. 


D. Multi Library Wavelet Neural Network(MLWNN) 

The Wavelet Neural Network (WNN) is builded by many 
methods [2,3], Zhang used two stages to construct the WNN: 
First, build a wavelet library W of discretely dilated and 
translated version of the wavelet mother function V F: 


w =\yM x ) = C(!Aa{x-bha=[ 

(5) 

Where xk is the sampled input and L is the number of 
wavelet in W. Then the best M wavelet is choosed based on 
the training data from the wavelet library W, in order to 
construct the regression: 

f M ( x )=y=Iwy 4 x )' (6) 

iel 

Where I is a M-element subset of the index set 
{1,2 and M < L. 

Secondly, the reduced the cost function: 

AO = min -£f yr'LwWM 

wiMlYl k=l ^ iGl 



Two heuristic algorithms has derived by Zhang, namely, 
stepwise selection by orthogonalization for deciding 
appropriate wavelet in the hidden units and backward 
elimination for choosing the number of the hidden units. The 
number of wavelets M, which are selected as the minimum of 
the so-called Akaike’s final prediction error criterion 
(FPE)[2,3]: 


Where ^ is the number of parameters in the estimator. 

The Wavelet Neural Network is trained by the gradient 
algorithms, while the Least Mean Squares (LMS) are used to 
reduce the mean-squared error: 

0 A X 


A w ) = -Z 

n i= i 


y-y {w) 


( 8 ) 


Where j(w) is the real output of the Wavelet Neural 
Network at the fixed weight vector w. 

The time-frequency locality property of the wavelet is 

applied to give a signal f , a candidate library W of the 
wavelet basis can be constructed [35]. 

There exist constants r such that: 


/- 

m,ne /? 


= o(e) 


(9) 


Hence, based on (10), the approximation can be realized by 
a three-layer network, as shown in Fig. 2. 
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ak 




Fig. 2: The three layer wavelet network 
Where X=(xl,x2,...,xd)E 0^ is the input vector of 
network; constant d represents the dimension; 

y 1 y are the output of network; \jf k , 1 < k < r , is 

the ith wavelet candidate in library; C k , 1 < k < r , are the 

weights of the network from the hidden layer to output layer 
and r is the number of the wavelet function basis in the 
candidate library. However, r drastically increases with the 
dimension d. The subset of should be identified by certain 
methods to approximate the signal /. 


E. Wavelet Network construction using Lasso-based Method 

A given a set of training data TN= {( x k) j{x 

which is used to adjust the weights on the WNN, and the 
output of the three layers of the wavelet neural network in 
Fig.2 can be expressed as : 




L ^ 

; ^ 

to r 

W ^ 

/ ' 
.J 2 > 

Ci 

II 

5? 

II 


V, 

Ci 


1 

; 

,^L 

3 

1 

Cr_ 


=wc 


( 10 ) 


(*) (*) 

Where y , \Jf , 1 < k < N, 1 < J < r is the output 

of the three layers of the wavelet network and the output of jth 
wavelet candidate for training data x(k) respectively. 

The task of Architecture of the wavelet neural networks is to 
select a subset from r wavelet candidates as neurons in the 
hidden layer in three layer wavelet network. This problem is 
also solved by a model selection. 

Furthermore, based on the determinate architecture, the task 
of the WNN learning is to obtain the sub- vector C. Many 
researches are used to select a small subset of the wavelet 
candidates in the library that best matches an output vector, the 


weights of subset is computed by iterative gradient-descent 
algorithm or least square method. 

The system identification application of the WNN is to 
represent the nonlinear dynamic system through as few 
neurons as possible. There is now an extensive literature 
indicating the wavelet neural network with few wavelets has 
the better generalization performance. 

We propose a new method for estimation in linear models to 
solve these two problems we present a novel approach to solve 
these task based on the Least Absolute Shrinkage and 
Selection Operator (LASSO), which reduces the residual sum 
of squares subject to the total of the absolute value of 
coefficients being less than a constant. Besides of the nature of 
this constraint it tends to create some coefficients that are 
exactly 0 and hence gives interpretable methods. Our 
simulation studies suggest that the Least Absolute Shrinkage 
and Selection Operator (LASSO) enjoys some of the 
favourable properties of both subset selection and ridge 
regression [22]. 

Based on this idea, these two problems can be transferred 
into below unconstrained optimization problem. 

The lasso is a regularization technique for simultaneous 
estimation and variable selection [22]. The lasso estimates 
are defined as 


C (Lasso) = Afm>l||C|| + — Y— ^Cj| 


( 11 ) 


Where A, is a nonnegative regularization variable. The first 
term in (11) is so-named “L0 penalty”. One appealing method 
is the L0 regularized regression which punishes the number of 
nonzero features in the model directly. 

The second term in (11) represents the measurement of the 
accuracy of model. The L0 penalty is attractive for variable 
selection because it directly punishes the number of nonzero 
weights. However, the optimization involved is nonconvex 
and discontinuous, and therefore it is very challenging to 
implement and this optimization problem is NP-hard, i.e., the 
computational time for solving this optimization is non- 
polynominal. Moreover, its solution we shall attempt to 
replace the optimization problem (11) with below convex 
relaxation (Lasso problem). 


C(Lasso) = MinAjc\\+- F-^C|| 


( 12 ) 


The solution of optimization problem (12) is equal to the 
one of optimization problem (11). Moreover, since the 
objective of (12) is an unconstrained convex function, we can 
apply standard approachs to construct a minimizer. The 
quickly algorithms of optimization problem (12) also exist. 


The architecture and learning of wavelet neural networks 
are considered as searching the position and values of nonzero 

entries from r entries in vector C. Suppose ^ is the set of 
optimal positions of nonzero entries and Q denote the 

values. At the same time, assume F = j / : C ^ 0 r is the 
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position of nonzero entries of solution of the optimization 

A 

problem (16) and C r denote the values in positions T . H. 
Zou points out the lasso have the following oracle properties 
[23]: 


• Determines the right subset model, 


' = j/ : c ; ^ 0 j 


=o 


• Has the optimal 

-( ^ ^ 

-> N 


'fN C r -C t 


estimation rate, 
where 


(“•S' ) 


^ is the covariance matrix knowing the true 
subset model. 


In our approach, we introduce wavelet-based weighted 
LASSO with different weighting schemes in the penalty term 
and discuss some screening strategies that can be used to 
wavelet-based functional linear model. We assume readers 
have certain familiarity with wavelet transform. 

The weights of the Wavelet Neural Network (WNN) are 
updating rules based on iteratively reweighted least squares 
(IRLS) algorithm which be proposed. The LTS estimator is 
used in robust linear parametric regression problem to non 
parametric LTS wavelet network for nonlinear regression 
problems. The main motivation is that the Least Trimmed 
Square (LTS) estimator usually has good robustness against 
outliers for linear parametric regression tasks. 

The residual ei at the ith ouput node is presented by 

A 

ei = y-yr ien (13 > 

The LTS estimator is used to select network weights that 
minimize the total sum of trimmed squared errors 

E,..A±±el <14) 

£ k = 1 i = 1 

Appropriate updating rules for minimizing (12), iteratively 
re weighted least squares (IRLS) algorithm is used to 
approximate the optimal weights. 

To find the parameters C = £* ) T which 

minimize the LI norm for the linear regression Lasso problem 

(12), 


MM \ c i + i 


Y-X/A:l = MM^\ c \ + l -i\y- ¥i C 


h (15) 


The IRLS algorithm at step t+1 involves the weighted linear 
least squares problem: 

, „ . „ , Vl (16) 

m HI c 1i + 2 c {m =^^%^ + ^wT\y -¥ c \ 2 =[^VyJ V of y- 

Where co (t) is the diagonal matrix of weights usually with all 

components set initially to : 


CO, 


(0) n 

CO = 1 

And updated after each iteration to: 

(O _ l 

max(^,|y ; -y/ I -C (?) |) 

Where 5 is the small value, like 0.0001. 


(17) 


Algorithm IRLS-GD 

Set tf/ 0) = 1 , e° = 1 ,t=0 

1: while £ f ^ 0 do 
t=t+l 

2: Compute C ^ via (16) 

3: £ t+l = min (£ t+1 ,C (t) ) 

4: Compute &^ } via(17) 


opt 

a i 


3 (e ) 

d opt 

a i 


N p 

= x e ( x )ys 

i = 1 


f v opt 

x - a r 


bi 


opt 


(18) 


b op ' =— t = Y e {x)yA 

1 db°r « n 

5: end while 


< x-aT 

bT 


(19) 


F. Approximation of the DNA Sequence Signal 

DNA sequence cluster is an NP-complete problem. Indeed, 
when the alignment is beyond two sequences, the task quickly 
becomes very complex because the space of comparison 
becomes very important. The recent advance of the DNA 
sequence technology has brought about a consequent number 
of the DNA sequences. We can are to analyze some million 
sequences and a first phase for this analysis is applied to 
determine there is a structure of the DNA sequence data in 
homogeneous groups according to a criterion to be identified. 

In this study, a classifier is applied to cluster the dataset of 
the DNA sequence using the Power Spectrum to process the 
signal of sequence and the application of the wavelet neural 
networks as a classification method, which solves the 
classification tasks of the DNA sequences. Initially, the 
proposed approach can bring the learning index defined by the 
wavelet network to develop a compact signature of the DNA 
sequence, which is composed by the wavelet coefficients, 
which are applied to match the DNA test with all the 
sequences in the training set. Then, for the cluster, the DNA 
test is projected onto the wavelet neural networks of the 
learning DNA sequences and new coefficients specific to this 
sequence are computed (Fig. 3). Finally, we compare the 
coefficients of the learning DNA sequences with the 
coefficients of the DNA test sequences by computing the 
Correlation Coefficient. In this step, we apply the principle of 
k-means clustering to classify the characteristics of the DNA 
sequences. 
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Fig. 3 DNA sequence classification by WNN 


To approximate f(x) of the DNA sequences, the optimal 
wavelet function is choosed to obtain signal representation 
with minimal error rate. To solve the approximation task, a 
library wavelet containing a family wavelet is applied. It is 
called Multi Library Wavelet Neural Network approach 
(ML WNN) [40,44]. In our approach, the second phase is to 
construct the library wavelet and approximate the function f(x) 
of the DNA sequence. 

G. Learning Wavelet Network Using the Lasso-LTS 
Method 

In this section, we show how we can learn the wavelet 
neural networks using the MLWNN [25-26-16-17-43-46]. 

1 . Proposed Learning Algorithm 

Step 1: The original data of the DNA sequence is divided into 
two groups: training and testing dataset. 

Step 2: Convert the DNA sequence to a genomic signal using 
a binary indicator and Power Spectrum Signal Processing 
Step 3: Construct a library W of discretely dilated and 
translated version of a given wavelet Iff , according to the 
available training data set. 

• Use the Least Trimmed Square (LTS) estimator for 

choosing the optimal mother wavelet function from 
the library wavelet (13) (14). 

• Choose, from the library, the N wavelet candidate that 

best matches an output vector. 

Step 3.1: Initialize the mother wavelet function 

library xp = {q# i = 1,2,... m , m = ||vf/ || , is the 

number of the wavelet candidates included. The LST estimator 


is applied to choose the best and the optimal wavelet 
functions wavelet Bes t= wavelet* 

Step 3.2: Randomly initialize w jk and v*j. 

Step 3.3: For k=l,...,m 

A 

a) Calculate the predicted output y . via (3). 

A 

b) Compute the residuals e ik = y.-y. in(13). 

c) If the stopping criterion is achieved, then 
stop; otherwise, go to the next step 

d) Find the ranked values e 2 ik < < £ 2 /m 

Choosing the N best mother wavelets function to 
initialize the WNN. 

Step 4: Use the translation b* and dilation a* of th N relevant 
wavelets as initial values. 

Step 5: Use IRLS-GD algorithm to 

opt opt J J opt 

compute yy.. ,q. andfr. . 

Step 6: Construct an empty matrix (Classes_signature_DNA)( 

opt opt J J opt 

Wij ’ CLi Utlaly . ) which has to contain the clusters of 
DNA sequences. 

Step 7: Let Classes_signature_DNA= S i ={ 

opt opt J opt 

Wij ’ CLi ’ Di } the set of data sequences and 

V= { v 1 , v2, . . . , vc } be the set of centers . 

Step 8: Randomly select ‘c’ cluster centers 

Step 9: Compute the distance between each data point and the 

cluster centers. 


285 


https://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 


International Journal of Computer Science and Information Security (IJCSIS), 
Vol. 14, No. 6, June 2016 


Step 10: Assign the data point to the group center whose 
distance from the class center is minimum of all the cluster 
centers. 

Step 11: Recompute the new class center using: 



Where, ‘c- represents the number of data points in i th cluster. 
Step 12: Recompute the distance between each data point and 
new obtained group centers. 

Step 13: If no data was reassigned then stop, otherwise repeat 
from step 10. 



Fig. 4 Proposed approach 


H. The Beta Wavelet Family 

The beta wavelet is used to construct the wavelet neural 
network of the proposed approach. The function beta is 
defined by P(x)=p x0 , x i,p,q(x) [26-27-28-32-33-41-45], x 0 and X! 
are real parameters. 


f${x,p,q,x^xf) = 




x-x 


Y 




VV x „ / 

0 




x-x 

1 

\xrx cJ 


if xe [x 0 ,x, 
otherwise 


( 21 ) 

We have proved, in [26-27-28-36-37-42], that all the 
derivatives of Beta function ^ L2(5R), are of class Coo(Fig. 4) 
and satisfy the admissibility wavelet condition. 

The general form of the nth derivative of Beta function is: 




n 

£c l n 

i= 1 


, /fr) , 

’ djP ' 


/ yji n\p | n\q 

^ ( \/i+l / \/i+l 

{x-xO) U“*) 


^ x yi (n—i)\p | (n-i)lq 


t \n+l—i ( \n+l—i 
(x-xo) (xi- x ) 


/3(x)+p n (x)p ] (x)/3(x)+ 
x P\(x)/3(x) 

( 22 ) 


where: 


PM) 



(23) 


Y = (-lV 


n ! p 


n\q 


(24) 


If p = q, for all n e N and 0 < n < p, the functions T^x) = 
d n p(x)/dxn are wavelets [26-27-29-34-38-39]. 



- 0.05 -IS 1 & * 

0 SO 100 0 £0 KB 0 50 10 C 


Fig.5 First, second and third derivatives of Beta function 
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III. Results and discussion 


This paper uses three datasets HOG 100, HOG200, and 
HOG300. Each dataset was randomly selected from 
HOGENOM[30], which comprehends homologous gene 
families from microbial species. Table 1 lists the details of 
these datasets. The HOG* dataset contains families that vary 
from 100 to 300. 


f 1 = £— maxU(-M)} (28) 

7=1 n i = 1 

Where the max is taken over all groups at all levels, and n is 
the number of the DNA sequences. 

Z denote the number of clusters in the whole grouping result 
and m denote the number of families in the dataset. 


Table 1. Datasets of microbial organisms DNA sequences 


Dataset 

# 

families 

#DNA 
sequences in 
the dataset 

Average length of a 
DNA sequence in 
the dataset 

Dataset 

size(MB) 

HOGIOO 

100 

9648 

1484 

15.1 

HOG200 

200 

22585 

1557 

37.0 

HOG300 

300 

27825 

1448 

42.6 


To evaluate the performance of the proposed approach, we 
have developed different experiments, each consisting of a 
different subset of test data. The classification comparative 
analysis is performed using a selection of published empirical 
datasets and synthetic DNA datasets [30]. 

Table 2. Distribution of Available Data into Training and Testing 
Set of DNA Sequence 


Dataset 

Total 

Training 

Test 

HOGIOO 

500 

300 

200 

HOG200 

600 

400 

200 

HOG300 

700 

600 

100 


Experiment results were performed to prove the 
effectiveness of our proposed approach. Evaluation metrics 
namely Precision, Recall and F-measures are used to compare 
our approach with other competitive methods. 

The F-measure combines the precision and the recall metric. 
We then calculate the recall and the precision of that cluster 
for each given class. More specifically, for cluster j and class i. 


1 ) Preferred length of the feature vectors 



Fig. 5 Clustering results of the wavelet-based feature vector (WFV) model 
against the length of the feature vector on different datasets. 


Fig. 5 shows that the wavelet-based feature vector (WFV) 
model could achieve the best clustering result when the length 
of the feature vector was 32 for HOG 100 and HOG200, and 
the next best when the vector was 64. However, the difference 
of classifying results was very small between 32 and 64. 
Consequently, 32 was the preferred length of feature vectors in 
our tests. As a result, a longer feature vector may not achieve a 
better clustering result. A shorter feature vector can reduce 
computation time, which is very helpful in processing large- 
scale DNA sequences. 


The recall of group j to family i is as follows: 


R ecall(j,i) = — 

n i 

n a 

Pr ecision(i, j) = — L 

n j 

Where n^ is the numbers of members of the class i in the 
class j, nj is the number of the DNA sequences of the group j 
and ni is the number of the sequence of the class i. 

The F-measure of the class j and the group i is the given by 


(25) 

(26) 



Length of feature vectors 


-HOG3QO 

-HOG200 

-HOGIOO 


Fig. 6 Clustering results of our method WNN-Lasso against the length of the 
feature vector on different datasets. 

F(j,i ) = (2*Recall(i, j) * Pr ecision(i, j)) / ((Pr ecision(i, j) + R ecall(i, j )) 


(27) 


For an entire k-means algorithm the F-measure of any group 
is the maximum value it attains at any node in the tree and an 
overall value for the F-measure is calculated by obtaining the 
weighted average of all values for the F-measure as computed 
by the following. 
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The WNN-Lasso model (Our proposed approach) achieves 
the best clustering result when the length of the feature vector 
was 128 on different datasets (HOG 100, HOG200 and 
HOG300). The last best when the vector was 32. However, 
the difference of classifying results was very small between 
32(The average F-measures=0.57) and 128(The average F- 
measures=0.59). Consequently, 32 was the preferred length of 
feature vectors in our tests. As a result, a longer feature vector 
may not achieve a better clustering result. A shorter feature 
vector can reduce computation time, which is very helpful in 
processing large-scale DNA sequences. 

2 ) Clustering results 




Dataset HOG3DO 


Fig. .9 clustering results in F-measure of the five alignment-free models and 
the WFV and the WNN-Lasso on dataset HOG300. 


We compared the WNN-Lasso model(our proposed 
approach) and the WFV with the five other alignment-free 
models, i.e., k-tuple, DMK, TSM, AMI, and CV. The WNN- 
Lasso model fixed the length of the feature vector to 128 for 
the all the DNA sequences on the three datasets(HOGKX), 
HOG200 and HOG300). For the other five models, the feature 
vectors were significantly affected by the size of the sliding 
window. The sliding window for TSM was 2, while for the 
rest it was 3. Since the length of each codon is three in the 
DNA, it might be beneficial to retain the genetic information 
of the DNA sequence. 


Fig.. 7 Classifying results in F-measure of the five alignment-free models and 
the WFV and the WNN-Lasso on dataset HOGIOO. 



Figures 7,8,9 illustrate the clustering results from all the 
models in the F-measure on the three datasets(HOGKX), 
HOG200 and HOG300). It is clear that the performance of 
WNN-Lasso was best. The WFV did not perform quite as well 
as WNN-Lasso, but was much better than other methods. 


Fig.. 8 clustering results in F-measure of the five alignment-free models and 
the WFV and the WNN-Lasso on dataset HOG200. 
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Table 3. The best clustering results in F-measure of the WNN-Lasso and the alignment based models on different datasets 



Our approach 

WFV 


K-tuple 


DMK 

TSM 


AMI 

CV 

Dataset 

F-measure 

# class 

F-measure 

#class 

F-measure 

# class 

F-measure 

# class 

F- 

# 

F- 

# class 

F- # class 











measure 

class 

mea 


meas 













sure 


ure 

HOG 100 

0.68 

1200 

0.58 

854 

0.52 


451 

0.55 

658 

0.48 

127 

0.42 

189 

0.48 564 

HOG200 

0.7 

879 

0.57 

845 

0.53 


566 

0.55 

754 

0.48 

840 

0.43 

412 

0.48 651 

HOG300 

0.78 

897 

0.58 

185 

0.52 


576 

0.54 

256 

0.48 

230 

0.43 

465 

0.48 542 


The table 3 shows the best classifying results in F-measure of the WNN-Lasso and the six alignment based models, i.e. K- 
tuple, DMK, TSM, AMI and CV., on the different datasets(HOOG100, HOG200 and HOG300). The six alignment based models 
are far worse than WNN-Lasso (our proposed approach) and the WFV. However, the alignment methods are designed for finding 
the moste similar the DNA sequences, namely, every sequence in the cluster must have similarity above a given identity 
threshold. 

3 ) Running time 


Table 4. Running time in seconds of each method on all datasets 


Dataset 

Model 

Length of feature 
vector 

Time of building 
feature vector 

Time of k-means 
clustering 

Total running 
time 

HOG 100 

Our Method(WNN-Lasso) 

128 

5.3569 

75.645 

81.0019 


WFV 

32 

8.4857 

102.2634 

110.7491 


K-tuple 

64 

7.9544 

763.5223 

771.4767 


DMK 

64 

30.2172 

1550.4054 

1580.6226 


TSM 

12 

32.7890 

576.7237 

609.5127 


AM 

4 

167.8232 

326.0293 

493.8525 


CV 

64 

24.5168 

2715.1169 

2739.6337 

HOG200 

Our Method(WNN-Lasso) 

128 

16.758 

350.664 

367.422 


WFV 

32 

20.5239 

645.8376 

666.3615 


K-tuple 

64 

19.6306 

3010.5426 

3030.1732 


DMK 

64 

74.3083 

7210.1629 

7284.4712 


TSM 

12 

82.2772 

2144.6954 

2226.9726 


AM 

4 

410.8531 

1059.0261 

1459.8792 


CV 

64 

60.3402 

9247.6313 

9307.9715 

HOG300 

Our Method(WNN-Lasso) 

128 

15.556 

854.656 

870.212 


WFV 

32 

24.1259 

1349.6459 

1373.7718 


K-tuple 

64 

22.5723 

5560.3319 

5582.9042 


DMK 

64 

85.1456 

13785.8160 

13870.9616 


TSM 

12 

92.8571 

3329.9724 

3422.8295 


AM 

4 

471.9211 

1577.3361 

2049.2572 


CV 

64 

69.3221 

16609.7183 

16679.0404 
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The proposed approach (WNN-Lasso) also improves the 
classification accuracy of the DNA sequences, but also 
shortens the running time. Table 4 shows the average running 
time of the all methods on the tree datasets of DNA sequences 
(HOOGIOO, HOG200 and HOG300). 

The WNN-Lasso’ feature vector building time was much 
shorter than the other models. It is more important that the 
clustering time of WNN-Lasso was the shortest, so WNN- 
Lasso is more suitable for enormous quantities of the dataset. 
The models (K-tuple, DMK, TSM, AM and CV) used in these 
studies range from sequence and structure alignments. A large 
set of sequences can be simultaneously compared using 
Multiple Sequence Alignment which is known to be NP- 
complete problems. However, full the applied models are still 
computationally very expensive and require significant 
computational infrastructure. Our goal is to construct 
predictive models and cluster that are highly accurate and 
interpretable. This approach can affect the time complexity of 
the similarity computation between the DNA sequences. So 
the complexity of The models (K-tuple, DMK, TSM, AM and 
CV) are higher than the WNN-Lasso and the WFV.ure vector 
building time was little than and it was much shorte than of the 
other models. 

Table 5 MSE of Approximation of the Signal for DNA using our 
method 


DNA sequence 
for each Class 

Seq. length 

MSE 

(Mean Square Error) 

Training 

Time(sec) 

HOG 100 

128 

0.009958 

9.3569 

HOG200 

32 

0.002582 

5.758 

HOG300 

64 

0.007231 

7.556 


First, during the approximation phase, our proposed 
approach tried to decompose the input signal for every 
sequence and then tried to reconstruct the input signal. The 
estimation of the performance of this phase was measured by 
the Mean Square Error (MSE). Table 5 shows that the Mean 
Square Error (MSE) obtained are low (0.002582) and the run- 
time increases relatively with the size of the DNA sequence. 
The results show that the size of the DNA sequence increases 
the time of the training phase. The time depends on the size of 
the DNA sequence. When the size is equal to 128, the training 
time is equal to 9.3569 seconds. 

The results show that the WNN can achieve very good 
prediction accuracy. The results of our approach (Wavelet 
Neural Network (WNN)) tested on empirical datasets show 
that accuracy outperforms the other techniques in terms of 
percentage of the correct species identification. Table 3, Table 
4 and Table 5 show the distribution of the good classifications 
by class as well as the rate of global classification for all the 
DNA sequences of the validation phase. 


IV. Conclusions 

In this paper, we have used a new method of training called 
Library Wavelet Neural Network Model (MLWNN). It is used 
to construct Wavelet Neural Network (WNN). The WNN is 
used to approximate function f (x) of a DNA sequence signal. 
Our proposed approach depends on the Power Spectrum, 
processing the DNA sequence signal. Applying this k-means 
classification enables us to group the similar DNA sequences 
according to some criteria. This classification aims at 
distributing n DNA sequences characterized by p variables 
XI, X2...Xp in a number m of subgroups which are 
homogeneous as much as possible while every group is well 
differentiated from the others. 

In our approach, achieve very good prediction accuracy. The 
results of our approach (WNN-Lasso) tested on empirical 
datasets show that accuracy outperforms the other techniques 
in terms of percentage of the correct species identification. Our 
proposed approach outperforms the other models (K-tuple, 
DMK, TSM, AM and CV) in terms of both the classifying 
results and the running time. 

This approach helps to classify organisms into different 
categories and groups which have significant biological 
knowledge and can justify the evolution and identification of 
unknown organisms. Simulation results are demonstrated to 
validate the generalization ability and efficiency of the 
proposed Wavelet Neural Network Model. These results have 
been realized thanks to many capacities listed as; 

• The capacity of Library Wavelet Neural Network Model 
(MLWNN) to construct the Wavelet Neural Network 
(WNN). 

• The capacity of binary sequence indicators Codification, 
Fourier Transform and Power Spectrum to process the 
signal of DNA sequences, 

• The capacity of the oracle properly of LASSO can 
guarantee the optimal structure of the WNN, 

• The capacity of the networks of wavelets in approximate 
of the functions real gives a complex, 

• Finally, a powerful tool and a pipeline to perform 
organisms classification are provided to the DNA 
sequences community. 
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ABSTRACT-Morphological analysis is the process of 
constructing and deconstructing the words of a language, the 
process is based on the basic grammatical units which are stem, 
prefixes, suffixes and infixes. Sindhi is rich in morphological 
features with a great variety of affixes. The problem for Sindhi to 
come into computerization is the large number of variants in its 
morphology. This complexity is created due to different positions 
of prefixes, suffixes and stems in the words. The automatic word 
segmentation system normally faces such embedded hurdles in 
Sindhi language. An algorithm is required with a capability of 
dealing with such issues for the segmentation of Sindhi words. In 
this paper, an algorithm is designed and implemented to resolve 
the problem of segmenting Sindhi complex and compound words 
into possible morphemes. The developed words segmentation 
system has been tested on a list of 109 compound words, 179 
prefix words, 1343 suffix words and 50 prefix-suffix words. The 
cumulative segmentation error rate of 5.02% is calculated. This 
system can also be used as pre-requisite in various Sindhi 
language and speech processing applications. 

Keywords — Sindhi Morphology; Morphological Analysis; Word 
Segmentation; Morphemes 

I. INTRODUCTION 

Each natural language carries its specific and peculiar 
mechanism for generation of the words and conversion of 
other words from the root words. Morphology is a branch of 
linguistics which purely deals with the study of language from 
scientific point of view, concerning with words and their 
constructive grammatical units. The breaking or constructing 
units are prefix, infix, stem and suffix. Two types of words, i.e. 
basic and secondary are found in Sindhi. The basic words 
cannot be broken up any more but the secondary words are 
breakable and devisable into complex and compound words. 
The complex words are in the class of secondary words and are 
built by combining prefix/stem/suffixes. Compound words are 
formed with the combination of at least two words [1]. 

Sindhi, an Indo- Aryan language [2], bears a high degree of 
similarity with modern-day Urdu, Hindi and some other 
languages of northwest Indian sub-continent. There is also a 
firm relation among Sindhi, Arabic and Persian which is a 
contact-induced loaning and borrowing of many words from 
on to the other. The script used for Sindhi in South Asian states 
like Pakistan and India is purely Perso- Arabic on predominant 
basis. Apart from these regions, the same script is used by 
Sindhi migrants who are settled across the world. The Persian 
is involved in Arabic script for the representation of several 
letters which are not present in Arabic but are required to 
represent some implosive, retroflex and nasal sounds. The 


inflections and derivations in Sindhi script are most frequently 
found with the use of prefixes and suffixes. This proves the 
richness of Sindhi in terms of morphology. The issue arises 
due to a large number of morphological variants in Sindhi 
which is yet to be analyzed and solved successfully. 

A compound word is usually formed by coalition of two or 
more simple words, i.e. $ (Rose), ol j (Day Night). 

Prefix words which are the part of complex or derivative 
words are built with the union of prefix and stem or root words 
like (a sin) is a primary word when combined with prefix 
^ (a prefix that shows opposite meaning) becomes A& ^ 
(innocent). In this, A& (a sin) is a free morpheme and ^ is the 
bound one [1]. Suffix words are the result of the combination 
of a root word and suffix, such as (understanding) is a 

primary word when combined with suffix 5 (a suffix that 
shows infinitive mood) becomes (to understand). There 

are also many words in Sindhi dictionary which carry both the 
prefix and suffix along the stem or root word. The example of 
such words is (Country) is a root word, in condition of its 
combination with prefix ^ (a prefix that shows the sense of 
far), it forms ^ (Abroad) and with addition of suffix j, 
then it turns into jj (Foreigner). 

The automatic segmentation of words into morphemes through 
computer is what we call computational morphology. The 
morphological analysis is of assistance for many Natural 
Language Processing (NLP) applications working with large 
vocabularies [3]. For instance, it is traditional to preprocess 
texts by returning words to their original forms, specifically in 
text retrieval in morphologically enriched languages of the 
world. In computational applications, morphological analysis 
is basically the segmentation of words into tokens morphemes. 
The analysis separates the stem (core part of word) from the 
prefix (the letter-addition in the beginning of word) or the 
suffix (the letter-addition in the end of the word). Moreover, 
different approaches and methods have been proposed and 
developed for morphological deconstruction of words. They 
include statistical language modeling [4] [5], lexeme-based [6] 
[7], rule-based [8], syllable-based [9] and corpus-based [10]. 

II. LITERATURE REVIEW 

For the past half-decade, many a great works have been 
published in the field of Sindhi linguistic applications, Rahman 
has worked Sindhi Morphology and Noun Inflections [1] in 
which he has discussed the variation of morphemes in nouns 
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with respect to the dialects used in Sindhi language. He has 
used addition, subtraction and replacement methods through 
which the basic morphemes are derived out in different forms 
due to the difference in the dialects. Apart from the 
computational perspective of the work, the grammatical 
discussion is also carried out like the numbers, genders and the 
cases of certain nouns. The conclusion of the research reveals 
that morphological construction of Sindhi language is either 
inflectional or derivational. 

Sindhi is a rich language in terms of the characters having 
various glyphs. Such characters do also change their form 
within script depending on their position or order in the text. A 
Sindhi tokenization model is proposed by Mahar [11] having 
three layers, each layer assigned a separate task. Similarly, 
Bhatti [12] has worked on the Sindhi tokenization and 
developed a Sindhi word tokenization model. He has 
implemented several algorithms processing the tokenization of 
Sindhi text into individual words. This way, they have built a 
corpus and a word repository for grammar checking method, 
Sindhi Spellings and other NLP applications. The issue is dealt 
with the first encounter of sentence boundaries and extracting 
each sentence into a separate list form. In this list, each 
element is a complete sentence. The next step is the 
segmentation of sentences into words. This segmentation is 
performed on the basis of hard and soft spaces are taken as a 
part of word. Thus, the soft spaces are ignored of 
segmentation. The final step includes the filtration of words, 
removal of special characters, converting word into a token 
and saving it after the validation is done. 

Sindhi is one of the Arabic script-based languages but its 
automatic segmentation application through morphological 
analyzer is yet unavailable. Though, Mahar [13] has developed 
four algorithms which possess the capability of segmenting the 
words into the root level, a higher degree of computational 
complexity regarding space and speed is the lapsing point of 
all of them. Due to its categorical function, Mahar ’s 
morphological analyzer uses the type of morpheme as its basis 
in each algorithm. Each algorithm works for a specific type of 
morpheme only so that process goes lengthy and slow for 
being an individualistic type. Therefore, a better and new 
algorithm is proposed for the segmentation of words into 
morphemes. 

III. SINDHI MORPHOLOGY 

Each language has its own grammar, foundation, and rules. 
Relatively, language is unique in its structure, function and 
application. For creating the awareness about morphology and 
analysis of words formation, some words are given in Table I. 
The first Sindhi word in Table I carries two morphemes: ann 
(£>l) Jaan The first morpheme is bound and the second is 
independent one. The slight change is notice in third word, the 
stem comes first and the addition at the second part. “>yj” is 
the root word whereas the added part 5 is a suffix which is 
entailed to a word to change its meaning and sometimes word 
class even [14]. 


TABLE I: Comparative Morphological Analyses 


| Sindhi Morphology 

Word 

First Morphology 

Second Morphology 


t> i 


4 -i 

1 


i 

i 

L> 


A. Bound and Independent Morphemes 
The bound morphemes are those smallest basic grammatical 
units which form their meaning when included in a word. 
Independently, they do not bear any meaning. Thus, the term 
suggests that they are bound with the words and do not stand 
independently having their meaning as a word. Consider the 
examples shown in Table II, the morphemes ft and j are the 
best examples of this type. Table III depicts the examples of 
Sindhi independent morphemes. 


TABLE II: Bound Morphemes 


Derivative 

Root 

Suffix/ Prefix 



6 

A? i 

i 

J 


TABLE III: Independent Morphemes 


Word 

Independent Morphemes 

Stem 


j i 


JL 

j* 

jl > 


B. Zero Morphemes 

There are several English words which are exactly identical in 
there different forms even. ‘Sheep’, ‘fish’, and ‘deer’ are some 
nouns which remain same in both plural and singular forms. 
Same is the case with some verbs like ‘spread’, ‘shut’ and 
‘put’. They remain same in their different forms of present and 
past. They are called homophonous. In both types of such 
words, whether nouns or verbs the phonological representation 
is zero. Therefore, these morphemes are known as zero 
morphemes. 

In Sindhi, no such types of morphemes are found [14]. 
Though, we may find some homographic words in this regard 
which do not change their structure for changing into past or 
plural, they change their sound because in Sindhi, some words 
can make plural just by changing their diacritics with the same 
set of letters. The examples are shown in Table IV. 


TABLE IV: Zero Morphemes 


Singular 

Plural 


4= 

$ 
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C. Root, Derivatives and Compound Words 
Sindhi does also contain the same word types alike English: 
root words, derivatives and compound words. These types of 
words are depicted in the Tables V and VI. The first words in 
the Table 5 show the variation of meaning only with no change 
in word class of the root word. The changing of the meaning 
into the opposite of the root word defines the nature of prefix 
(ju), which is used to attach with the word for making its 
negation or opposition. The formation of second word suggests 
the uniqueness of Sindhi morphology in which the only letter 
(I) is the suffix of the word. In addition to this, the letter (I) 
used as suffix does not affect only the formation and meaning 
of the word but also changes its word class from verb to a 
noun. The prefixes of the third and fifth words are also of the 
same kind as of the first. They all mostly change the meaning 
of the word into its negative or opposite. The fourth word 
contains the suffix (^p) which is used for the emphasis only. It 
does not change the meaning or the class of the word. 


TABLE V: Derivatives 


Prefix/Suffix 

Root Word 

Complete Word 

Ju 


£ J-lJu 

1 



V 



Lp 



Ji 




The first word in the Table VI contains two words as usual 
compound words do. The following words containing the same 
formation represent another property of such words which is 
the coalition of adjective and noun. Each of the words in the 
Table 6 is formed with one adjective and one noun. This 
endorses that most of the compound words in Sindhi possess 
the same nature in terms of their formation. 


TABLE VI: Compound Words 


Compound Word 

First Word 

Second Word 

j 


^Lj 





J*j> 



IV. DATA COLLECTION 

Corpus of language is inevitably essential for the 
computational exploitation. We have made the use of Sindhi 
corpus of 1, 05,733 words developed by Mahar [15], the 
sample of developed corpus is shown in Figure 1 . It subsumes 
the genres of music, arts, politics, environment, and other 
texts. The sources for the collection of information were 
magazines, books of different types and newspapers. The data 
was collected in HTML and PDF formats. Then, they were 
converted into the fair equivalent formats of texts. Table VII 
represents the comprehensive details in figures for Sindhi 
corpus. 


^ jr> <A & J 

g * ^ ? A bV 1 J: J J* j* Jk 

J g J/jj c ^ jJj y> i/i 6P </ (A jli* J l*> 

jf I cW f 0 a ^ jf I £■) J ^ 

Fig.l. Sample of Developed Corpus 


TABLE VII: Statistical Information of Sindhi Corpus 


Corpus Type 

Sentences 

Word Tokens 

Arts 

1897 

6884 

Sports 

1656 

7582 

Politics 

2590 

13351 

Environment 

1819 

7098 

Music 

3822 

15412 

Total 

11,784 

50,327 


A. Word Tokens 

For the representation of the statistical information of this 
corpus, the first step was taken to break the text into the 
sentences. The second was the segmentation of sentences into 
words. This way the words were given to the system and it 
retained 50,327 unique word tokens. These word tokens do not 
represent the number of words in the given corpus but each 
word makes a token regardless of how many times it is used in 
the text. 

Tokenization process is the segmentation of input objects of 
orthographic symbols into tokens [16]. This is the first 
prerequisite for NLP applications for these word tokens are 
then supplied to natural language processing applications for 
more computational processing. The word limits such as white 
space, digits, special signs and punctuation marks are useful 
for tokenization process. Apart from being useful, these 
sometimes also create complications in the process of 
tokenization. In this research, Mahar ’s tokenizer [17] is used 
which they proposed particularly for Sindhi language only. 
This model is composed of three layers which works 
consecutively one after the other as per the requirement. The 
implementation of the model, as done by Mahar, has also been 
imitated in this research work. 

B. Developed Lexicon 

A large lexicon is always required as a key component for the 
implementation of morphological analysis. It is, in general 
sense, a repository of words required to test the proposed 
algorithms. Hence, a lexicon for computational process is built 
with a collection of morphemes that are prefixes, suffixes and 
stems. 

In print and electronic media, as the most of the Arabic script- 
based languages are written or typed without a variety of 
diacritic marks required for exactness of the sense, so is the 
case with Sindhi. Therefore, the basic limitation is the 
requirement of a fully diacritized corpus in order to build a 
lexicon. This may create another issue of the availability of 
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different versions of the same word with different diacritics in 
the lexicon. The words, then, may cause a great ambiguity with 
reference to their vocalization and meaning as well. Therefore, 
it is crucially essential to save all the words with full diacritics 
in the lexicon. 

A lexicon having 50,327 words is built for the implementation 
of proposed algorithm. The lexicon is developed to segment 
Sindhi words into morpheme sequences. It has five tables and 
each table is used for the storage of separate type of word 
morphemes. The tables namely are root words, compound 
words, prefix words, suffix words and prefix- suffix words. 

The developed lexicon is called Lexicon of Sindhi 
Morphological Analysis (LSMA). It is peculiarly constructed 
for proper and exact segmentation of words in Sindhi text. The 
lexicon contains only secondary type of words taken from the 
corpus. Table VIII represents the manifestation of secondary 
words. 


TABLE VIII: Information of Secondary Words 


Word Types 

No. of Words 

Compound 

541 

Prefix 

893 

Suffix 

6713 i 

Prefix-Suffix 

247 

Total 

8394 ] 


V. SINDHI WORD SEGMENTATION ALGORITHM 
A word is constructed with letters in a particular sequence. The 
letters first build a morpheme which is the smallest 
grammatical unit of language. Morphological segmentation is 
a general method for disintegration of a word into the 
combination of letters. This combination is a morpheme and 
cannot be further disintegrated. The development of any word 
segmentation technique requires one to be well aware of 
already developed and established techniques in order to bring 
effectiveness to the system. 

A. Word Segmentation Technique 

During the literature survey of Arabic morphological analysis 
techniques, it has been found that three morphological 
approaches are mostly in use, i.e. Table Lookup Approach, 
Combinatorial Approach and Linguistic Approach. These 
approaches can also be used for Sindhi word segmentation into 
its possible morphemes. Many times these approaches have 
been used for Arabic, Persian and Urdu languages. As Sindhi 
language belongs to the family of these languages on the basis 
of its script and nature so it can be predicted that these 
approaches can stand useful for Sindhi. 

In this paper, Table Lookup Approach is used for the 
segmentation of Sindhi words into possible morphemes. This 
approach mainly relies on a considerably large set of tables in 
which Sindhi words are stored and found in natural texts with 
their morphemes. Morphemes are set in the forms of stem, 
suffix and prefix. A variety of words are found in a language, 

i.e. foreign words, functional words and proper nouns which 
require a unique place in the table. Multiple entries may also 
be found with the same structure which is due to the fact of 


different types of sense relations of words among them. The 
sense relations include homonymy, metonymy, synonymy, 
hyponymy and synonyms and antonyms. Few of these 
relations require a word to be spelt same but meant differently. 
These entries enable the system to be capable of dealing with 
multiple analyses of the words. 

The entries in these tables are stored in alphabetical letter. For 
the optimization of search through vertical and horizontal 
order, a hash table stands efficient and effective to be used. In 
addition to this, a compression or precision technique is also 
possible to be used effectively for the reduction of storage 
needs. Thus, it makes the morphological analysis quite simple 
by accessing hash table. 

B. Proposed Sindhi Word Segmentation Algorithm 
The lexicon driven approach is used for our proposed 
algorithm, therefore, a lexicon named LSMA is constructed 
that stores all possible morphemes, and the lexicon consists on 
five tables {Tl, T2, T3, T4, T5}. 

The database table Tl is constructed for storing all the possible 
root words. The database table T2 is constructed for storing the 
compound words with three column vectors T2= {Cl, C2, 
C3}. The column Cl is used to store the complete compound 
word, C2 is used for storing first word and C3 is used for 
storing second word. 

In lexicon LSMA, database table T3 is constructed for storing 
words having prefix morpheme, it has three column vectors 
T3={C1, C2, C3} where, Cl is used for storing prefix along 
with primary word, C2 is used for storing prefix morphemes 
and C3 is used for storing the primary word. 

The database table T4 is used for storing words having 
suffixes. It has three column vectors T4= {Cl, C2, C3}. Each 
column is responsible to store the segments of words after its 
breakage. Thus, Cl is used for storing words having suffix 
along with primary word, C2 is used for storing suffix 
morphemes and C3 is used to store primary word. 

The database table T5 is used for storing the words having 
both prefix and suffix morphemes at a time, this table consists 
of five column vectors T5= {Cl, C2, C3, C4, C5}, where Cl is 
used for storing the complete words having prefix and suffix 
morphemes, C2 is used for storing only prefix and C3 is used 
for storing only suffix morphemes, C4 is used for storing 
primary word and C5 is used for storing the primary word 
along with suffix morpheme. Prefix and suffix lexicon entries 
cover all possible concatenations of Sindhi prefixes and 
suffixes. 

Algorithm of Sindhi Word Segmentation 


1 . Input Sindhi Text 

2. Tokenize Input Text 

3. Store all word tokens into temporary array WORDTEMP 

4. Select words one by one from WORDTEMP 

5. Search Selected word from Column 1 of Table Tl //To 
check that it is a root word or not 

6. If search is successful then display message “This is a 
Root Word” and go to step 16 
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7. Else split selected word into characters and store them into 
temporary array CHTEMP 

8. Search selected word from Column 1 of Table T2 // For 
Compound Words 

9. If Search is successful then 

a. Repeat until either both words are successfully 
compared or any word is not found in Table T2 

i. Select characters consecutively from 
CHTEMP and append into VAR1 

ii. Search and compare VAR1 from 
Column 2 of Table T2 

iii. If search is successful then 

1. Concatenate remaining 

characters of CHTEMP and 
store into VAR2 

iv. Else go to Sub-step a 

v. Search and Compare VAR2 from 
Column 3 of Table T2 

vi. If search is successful then 

1. Display “First Word”, VAR1 
and “Second Word”, VAR2 

b. End 

10. Else search selected word from Column 1 of Table T3 
//For Prefix Words 

11. If search is successful then 

a. Repeat until both conditions are true or any 
morpheme is not found in Table T3 

i. Select characters consecutively from 
CHTEMP and append into VAR1 

ii. Search and Compare VAR1 from 
Column 2 of Table T3 

iii. If search is successful then 

1. Concatenate remaining 

characters of CHTEMP and 
store into VAR2 

iv. Else go to Sub-step a 

v. Search and compare VAR2 from 
Column 3 of Table T3 

vi. If search is successful then 

1. Display “Prefix”, VAR1 and 
“Root Word”, VAR2 

b. End 

12. Else search selected word from Column 1 of Table T4 
//For Suffix Words 

13. If Search is successful then 

a. Repeat until both conditions are true or any 
morpheme is not found in Table T4 

i. Select characters consecutively from 
CHTEMP and append into VAR1 

ii. Search and compare VAR1 from 
Column 3 of Table T4 

iii. If search is successful then 

1 . Concatenate remaining 

characters of CHTEMP and 
store into VAR2 

iv. Else go to Sub-step 1 


v. Search and compare VAR2 from 
Column 2 of Table T4 

vi. If search is successful then 

1. Display “Root Word”, VAR1 
and “Suffix”, VAR2 

b. End 

14. Else search selected word from Column 1 of Table T5 
//For Prefix-Suffix Words 

15. If search is successful then 

a. Repeat until all conditions are satisfied or any 
morpheme is not found in Table T5 

i. Select characters consecutively from 
CHTEMP and append into VAR1 

ii. Search and compare VAR1 from 
Column 2 of Table T5 

iii. If search is successful then 

1. Concatenate remaining 

characters of CHTEMP and 
store into VAR2 

iv. Else go to Sub-step a 

v. Search and compare VAR2 from 
Column 4 of Table T5 

vi. If search is successful then 

1. Display “Prefix”, VAR1 

vii. Split VAR2 into characters and store 
into array SUTEMP 

viii. Select characters consecutively from 
SUTEMP and append into SVAR1 

ix. Search and Compare SVAR1 from 
Column 4 of Table T5 

x. If search is successful then 

1. Concatenate remaining 

characters of SUTEMP and 
store into SVAR2 

xi. Else go to Sub-step viii 

xii. Search and compare SVAR2 from 
Column 3 of Table T5 

xiii. If search is successful then 

1. Display “Root Word”, SVAR1 
and “Suffix”, SVAR2 

b. End 

16. End 

The process of proposed algorithm starts with the input step of 
the text. The text can be input through two ways; it can be 
typed and produced to the system and can also be taken from 
the corpus of the language. Once the text is input, the process 
has begun. The input text is tokenized at the beginning of the 
process. The tokenization model of Mahar [18] has been used 
in this system. The tokenization sends the prepared tokens to 
an array called WordTemp. This array stored the word tokens 
so that they can be forwarded forth. The system then takes 
each token from WordTemp one by one and starts searching 
the match for the selected word. The first search is carried out 
in Tablet Column 1. If the search is successful, system displays 
the word as a “Root Word”. The process does not go further 
for the search is over and the match is found. This is because 
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we have stored the root words in Tablet Column 1 and the 
successful search witnesses the word as a root one. If the 
search is unsuccessful and match is not found, the control 
shifts to the next search step. Before moving to the next search, 
the system splits the word into separate characters that 
constitute it and stores them in an array called CHTEMP. 

VI. IMPLEMENTATION AND RESULTS 
After the details for the familiarization of our developed 
algorithm, the algorithm is taken into its application in the 
system. The application process is defined in this section along 
with the results received after the application. The results are 
not calculated at a whole but for the acute evaluation of the 
system, we have categorized the process into different parts. 
The system has been evaluated through separate classes of 
words i.e. prefix words, suffix words, prefix-suffix word and 
compound words. 

The performance of algorithm is evaluated by rating the 
correctly and incorrectly segmented words as given in [19]. 
Moreover, the segmentation error rate with each word class is 
calculated so that the vivid and transparent results can be 
obtained. These separate word class results will also help find 
the causes and issues that reduce the success rate of the 
system. This calculation standard is used under the influence 
of [19], Segmentation error rate (SER) is defined as: 

(Number of incorrectly segmented words/ total number of 
word) x 100 

A. Compound Words 

The main algorithm first makes it sure that the word is not a 
root one then it shifts to the search of the forwarded word in 
Table2 Column 1 for searching if the word if compound one. 
The successful search shifts control of main algorithm to the 
Module Compound Words. The process begins by taking in 
the split letters stored in CHTEMP one by one until VAR1 is 
formed by achieving a match from Column2 of this table. 
Once, the match is found and VAR1 is formed, the system 
generates VAR2 taking the remaining letters from CHTEMP. 
The forming of VAR1 requires a repetition process by 
appending letters one by one from CHTEMP. VAR2 is formed 
and it also requires a condition of must-match in Column3 of 
the table. When both conditions are fulfilled and VAR1 and 
VAR2 are formed the system displays the result by showing 
first word as VAR1 and second word as VAR2. 

After the selection of a word, the splitting into separate 
characters takes place and each character is selected one by 
one and all these characters are being appended and stored into 
a temporary generated variable. Then, system compares the 
contents of this variable with T2-C2. If the characters are 
matched properly, the concatenation of remaining characters 
starts and then these remaining characters and fed into another 
variable and again the comparison starts with T2-C3, in case of 
successful match, the system displays both words. For 
example, ol j (Day Night), each character is taken into 

process from right to left like j and it is compared with column 


C2, then ' is selected and both are appended together and again 
compared with C2. The system consecutively selects third 
character o and again all are appended and compared with C2. 
The successful search leads to the concatenation of remaining 
characters 5, j, j, &, j through the same procedure and 
comparison takes place with B3. After both conditions are 
fulfilled, words are displayed as wordl and word2 is ol j. 
In order to scrutinize and verify the system efficiency and 
performance, we took 109 words randomly for testing. These 
words were taken from training dataset of 541 words. The 
number of taken words stands 20% of training data. For 
experimental purpose, compound words were categorized into 
two classes; the words having a hard space in between like $ 
(Rose) and the words having no hard space like JJb 
(Every Moment). The gist of results is given in Table IX. The 
pictorial representation of word SER is given in Figure 2. 


TABLE IX: Segmentation Error Rate using Module 
COMPOUND 


Compound Word 

Classes 

No. of 
Words 

Correct 

Incorrect 

SER 

With Hard Space 

82 

82 

0 

0.0 

Without Hard Space 

27 

26 

1 

3.7 

Total 

109 

108 

1 

3.7 


The complication of compound words is observed during the 
process of morphological analyzer. It is particularly observed 
with the words having connecting letters in between the 
compound words. This leads to the erroneous depiction of 
morphemes in such situations. In addition to this, certain 
compound words have non-connective letters in between. They 
lead to another erroneously segmented morpheme for the 
remaining non-connecting letters in the second word form a 
word that has an entirely different meaning from the actual 
sense of the whole compound word. Thus, two erroneous 
morphemes are segmented by analyzer in this case. The 
situation leads to an increase in SER of the morphological 
analyzer. Due to these issues, the SER of the proposed 
morphological reached 3.7% with the compound words having 
no hard space in between and 0.0 with those having hard 
space. 
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B. Prefix Words 

The second step of search ends in two conditions; the word is 
compound and process shifts by executing Module Compound 
Word and second condition is taking the process to a search in 
Table3 Column 1. The successful search in this table starts the 
execution of Module Prefix Words. The PREFIX module 
receives a word from main algorithm as input and splits it into 
characters, system selects each character one by one and 
appends it into temporary generated variable and then 
compares the contents of this variable with T3->C2, if 
comparison is successful, then concatenates remaining 
characters and stores into another variable and compares it 
with T3->C3, if search is successful, then system displays both 
morphemes. For example, U (unloyal), the system selects 
each character from right to left like o and compares it with 
column C2, then selects character j and appends it as next 
character and compares with C2, if search is successful, then 
concatenates remaining characters j, <J, I and compares with 
C3, when both conditions are satisfied, then system displays 
prefix and root U j. 

The appending of letters and searching for a match in Column3 
is repeatedly performed till VAR1 is formed and match is 
sought out in Column3. The VAR2 is formed by appending the 
remaining letters together and the search is performed in 
Column3 of the table. Column3 has the root words in it. It is 
also understood that formulation of VAR1 extracts the prefix 
from the word and leaves the remaining letters which must 
form a stem and VAR2 as well. VAR1 is compared with the 
words stored in Column2 and VAR2 is compared with the 
words stored in Column3. After achieving both matches, the 
system shows the result as VAR1 “Prefix” and VAR2 “Root 
Word”. 

Evaluating the performance of this module, 179 words were 
randomly taken from the training dataset containing 893 
words. The words having prefixes are classified into three 
categories: (1) The prefix words showing the sense of negation 
like Ju (unlucky) (2) The prefix words showing the sense 
of adjective like cJ (matchless) and (3) The prefix words 
showing the sense of antonym (abroad). The summary 

of results is shown in Table X. The SER of negation, adjective, 
and antonym is depicted in Figure 3. 


TABLE X: Summary of Results using Module PREFIX 


Prefixes 

Classes 

No. of 
Words 

Correct 

Incorrect 

SER 

Negation 

68 

67 

1 

1.47 

Adjective 

97 

94 

3 

3.09 

Antonym 

14 

14 

0 

0.0 

Total 

179 

306 

10 

4.56 


The calculated results depict that the SER of Negation and 
adjective is higher than that of Antonym. Since the prefixes 
used to form a negative or opposite sense to that of the original 
meaning of the particular word can stand as a word separately 
with their own meaning. Such prefixes are also used as in 


individual word in Sindhi text. Therefore, morphological 
analyzer segments them as a separate word sometimes and its 
SER increases relatively. On the other hand, simple Antonyms 
having prefixes are segmented successfully with the SER of 
2.3% which is lesser than that of Negation and Adjectives. 



C. Suffix Words 

The SUFFIX module takes the word given as input and split it 
into characters, system selects each character one by one and 
appends it into temporary generated variable, and then, 
compares the contents from T4->C3, if comparison is 
successful, then concatenates remaining characters and 
compares it with T4->C2, if search is successful, then system 
displays both morphemes. For example, (All to all), the 
system selects each character from right to left like ^ and 
compares it with column C3, then selects character and 
appends it as next character and compares with C3, if search is 
successful, then concatenate remaining characters j, j and 
compares it with C2, when both conditions are satisfied, then 
system displays root word yw and suffix 

The process of this module begins with the input of separately 
stored letters of the selected word in CHTEMP. One by one, 
the letters are brought in till VAR1 is formed. After the 
formulation of VARI the module searches for its match in 
Column2. Column2 is responsible to store the root words 
therefore VARI in this module is the formulation of root 
words. The appending of letters and searching their match in 
Column is repeatedly done till its formulation and final match 
in Column2. After VARI, the module appends all the 
remaining letters and forms VAR2 which is a suffix and such 
type is stored in Column3. VAR2 is compared with the 
combination of letters stored in Column2 to find its match. 
After achieving the successful matches of VARI and VAR2 in 
their respective columns, the system displays result as VARI 
“Root Word” and VAR2 “Suffix”. 

The number of words taken randomly for testing from the 
training dataset was 1343. The total number of words in 
training dataset was 6713. The selected sample was taken in 
order to gauge the performance of this module. For 
experimental purpose, words with suffixes were categorized 
into 5 classes: (1) the suffix words in singular sense like 
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(Hungry) (2) the suffix words of plurality like I jb ^ (Jewlars) 
(3) the suffix words showing adjectival meaning like 
(Lucky) (4) the suffix words classed in masculine like 
(Boy) and (5) the suffix words of feminine like (Wife). 

The summary of results with the standard of SER is given in 
Table XI. The graphical representation of results is given in 
Figure 4. 


TABLE XI: Summary of Results using Module SUFIX 


Suffixes 

Classes 

No. of 
Words 

Correct 

Incorrect 

SER 

Adjective 

631 

622 

9 

1.43 

Singular 

112 

210 

2 

1.79 

Plural 

102 

99 

3 

2.94 

Masculine 

181 

179 

2 

1.10 

Feminine 

317 

312 

5 

1.58 

Total 

1343 

5,566 

47 

8.84 


The depiction of results proves Masculine class to be yielding 
the least SER in all. On the other hand Feminine class as well 
as Singular has acceptable level results with 1.79% and 1.85% 
SERs respectively. The cumulative SER is 8.84%. This is due 
to the Plural class of suffix words which stands with an SER of 
2.94%. Due to this class, the performance of whole system is 
affected and led to a higher level of SER. The reduction of 
SER in plural will ultimately improve the performance of the 
system. Eventually, besides Singular and Plural, the results are 
considerably better and encouraging as well. 



D. Prefix-Suffix Words 

The process of Module Suffix Words begins with the input of 
separately stored letters of the selected word in CHTEMP. One 
by one, the letters are brought in till VAR1 is formed. After the 
formulation of VARI the module searches for its match in 
Column2. Column2 is responsible to store the prefix 
morphemes therefore VARI in this module is the formulation 
of prefix morphemes. The appending of letters and searching 
their match in Column2 is repeatedly done till its formulation 
and final match in Column2 is found. After VARI, the module 
appends all the remaining letters and forms VAR2 which is the 
remaining part of the word containing the root and suffix and 


such type is stored in Column4. VAR2 is compared with the 
combination of letters stored in Column4 to find its match. 
After achieving the successful matches of VAR2 in its 
respective column, the system displays result as VARI 
“Prefix”. After producing the result of VARI Prefix, the 
system concatenates the VAR2 and split it into letters. The 
split form is stored in another array called SUTEMP. The 
selection of letters one by one from SUTEMP and appending 
them again starts till a SVAR1 is formed. After forming 
SVAR1, the system starts searching the match for SVAR1 
from Colum4 where primary words are stored. If system 
succeeds to find the match, it concatenates the remaining 
letters taken from SUTEMP and forms SVAR2. Then SVAR2 
is compared with the words stored in Column3. 

After finding the match of SVAR2 in Column3, the system 
displays the result as SVAR1 “Root Word” and SVAR2 
“Suffix”. It is noted that the concatenation and appending of 
the letters from TEMPs are repeatedly done till the search 
comes successful. The “Else” condition drives the system to 
jump to the previous step of concatenation and appending of 
letter and continues it till the match is found in the column. 

This module is based on two phases: system segments prefix 
and the stem in first phase and it cuts off suffix from the root 
word in the second. The word is appointed into the module 
from the main algorithm as input and concatenates it into 
separate characters. System takes each character one by one 
respectively and keeps appending them into a temporarily 
generated variable. While appending the characters it also 
keeps on comparing the contents of this variable from T5->C2. 
As the comparison comes to a successful match, then the 
remaining letters are concatenated and stored into another 
variable. Once more, the splitting and appending takes place 
and storing the characters into variable while comparing them 
with T5— >C4. Till the match comes successful during 
comparison process, then the rest of the letters are stored and 
the process repeats itself again undergoing each step that are 
already described. After the successful match while comparing 
the contents with T5— >C3, the system displays three parts of 
the word. 

For example, ^ ^ j (Foreigner), the system takes each 
character from right to left i.e. o and compares it the contents 
in C2, it selects j and appends to the previous character and 
again compares with C2, after successful search it concatenates 
the rest of the letters, s ^ and takes them through the 
same process. When the stem ^ .o is successfully segmented, 
it looks for the other characters j and does comparison with 
the contents of C3, after the fulfillment of all three conditions; 
system shows a display of prefix j j root word ^ .o and suffix 
l$. A list containing 247 words was prepared for training 50 
words. These words were tested through the system in this 
module. The outcomes are shown in Table XII. 


TABLE XII: Results using Module PREFIX-SUFFIX 


No. of Words 

Correct 

Incorrect 

SER 

50 

46 

4 

8.0 


300 
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E. Cumulative Results 

The developed morphological analyzer has been gauged in 
testing 109 compound words, 179 prefix words, 1343 suffix 
words and 50 prefix- suffix words. The overall results showed 
the SER of 5.02%. The calculated cumulative word 
segmentation error rate of different word classes is given in 
Table XIII. The Figure 5 depicts the cumulative segmentation 
error rate of the system in graphical form. 


TABLE XIII Cumulative SER of Each Word Types 


Types of Words 

Segmentation Error Rate 

Compound 

3.7 

Prefix 

4.56 

Suffix 

8.84 

Prefix- Suffix 

8.0 

Cumulative SER 

5.02 


The results show that compound words have resulted the least 
SER which is encouraging part of the work. The SER of these 
words is 3.7% cumulatively. Segmentation of suffix words 
produces an SER of 8.84% and the reason of its height is 
already described as the Adjectives with suffixes sometimes 
stand as completely separate words in Sindhi script. The SERs 
produced after suffix words and prefix-suffix words are at a 
little difference of 0.84%. 



Fig.5. Cumulative Words SER of Proposed Algorithm 


VII. WORD SEGMENTATION APPLICATION 
In spite of all the details given about the developed algorithm, 
its function, application and results, the need of more clarity 
remains intact for the understanding the whole research and its 
processed outcome through the system. The interface contains 
two boxes that are connected with the process of the given 
text. The upper box is responsible to show the text that is input 
into the system. This box not only accommodates the direct 
typing of the text but has a property of receiving an already 
developed file as its input. The system processes the text that is 
directly typed. In otherwise case, it receives the files which are 
in doc. format only. 

After the text is input into the system, the user has to click the 
Process Menu and a pop-up will appear in a drop-down box. 
The box has three options i.e. Apply, Data Setting and Clear. 
As the user will click the Apply button, this will activate the 


system to take the text for processing. The process ends up by 
showing the results in the output box of the interface. The 
depiction of input box and outbox are totally different in terms 
of the organization of the text. The input box takes the plain 
text as it is typed. The input and output box depicts the results 
in six different columns as shown in Figure 6. These columns 
have been assigned their respective morphemes. Each word 
from the text is processed and put into its respective column. 
The columns are given the names of the morphemes found in 
Sindhi language. Each column receives a particular morpheme 
taken out of the word after segmentation. 



VIII. DISCUSSION AND CONCLUSION 
Sindhi language has been considered as one the most complex 
languages when it comes to automatic language applications. 
The abundance of homographs in orthography of Sindhi and its 
cursiveness prove the above fact. Such nature embeds the 
hurdles in the way of word segmentation process. The affixes 
become rather complex to be segmented due to cursiveness. 
Thus, the developed word segmentation system is designed in 
such a way so that it should segment these basic grammatical 
units as an embarking source to NLP Applications. The 
proposed algorithm possesses the capability to deal with all the 
basic grammatical units of Sindhi: Root words, Prefixes and 
Suffixes and provides the base for segments its words into 
morphemes. 

The two data sets are extracted from our developed lexicon for 
experimental purpose: the training data set and the testing data 
set. The testing data set contains 109 compound words, 179 
prefix words, 1343 suffix words and 50 suffix-prefix words. 
After the process of words segmentation, compound words 
yielded the SER of 3.7%, the prefix words gave an SER of 
4.56%, and suffix words did 8.84% and prefix-suffix words 
8%. The individual calculation and cumulative segmentation 
error rates of the proposed algorithm derive out that the results 
have come up to the acceptable level. Although, 5.02% SER is 
produced, the correct segmentation supports the effectiveness 
of the proposed algorithm with an exactitude rate of 94.08%. It 
is a proven fact that Sindhi word segmentation is an essential 
for its application in any natural language processing task. 

The received results have achieved an acceptable level, 
though; they are not up to the mark as they should be. This 
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piece of research has paved a way to reach the ultimate [10] 
accuracy in NLP applications for Sindhi language. The current 
SER is surely possible to decrease in future; the achieved SER 
is a little high due to the limited lexicon. The SER can easily 
be decreased if the lexicon is extended to a great extent. The 
table lookup approach is used for automatic word [11] 
segmentation system. If the approached as combined at least 
two, the algorithm will be more useful for the same task. 

Hence, in future, we shall also test the combined system of 
combinatorial and linguistic approaches. [12] 
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Abstract — Most of the existing secret sharing schemes are based 
on polynomial interpolation. In other word, they 
use polynomial functions in their schemes. In this paper, we solve 
the problem of creating a secret sharing scheme based 
on rational interpolations. We show that if k > ma x{p, v) + 1 
support points have the same width then the rational 
interpolation of the support points, which is called (p^ ,v \x), has 
p. + v + 1 — k pole points. Finally, we give an example for the 
accuracy of the proposed scheme. 

Keywords-component; Secret Sharing Scheme; Shamir’s 
Scheme; Polynomial Interpolation; Rational Interpolation , Pole 
Points. 

I. Introduction 

Secret sharing schemes are important tools in modern 
cryptography. Some of applications of secret sharing schemes 
are access control, secure key management systems, secure 
multi-party protocols, electronic voting and etc. [1, 2]. In other 
word, secret sharing schemes can be used for any situation in 
which the access to an important resource has to be distributed 
over several parties. The case of opening bank vaults or 
launching a nuclear missile are both situations in which a secret 
sharing scheme can be utilized [20]. 

A secret sharing scheme is a method to distribute shares of 
a secret (called shadows) among a set of participants which is 
called P by giving each participant a share in such a way that 
only certain pre-specified subsets of P are qualified to recover 
the secret while any unqualified subset of P cannot do the same 
thing. 

The first ( t , n) -threshold secret sharing schemes were 
introduced by Shamir [3] and Blakley [4] independently 


based on Lagrange interpolating polynomial and linear 
projective geometry respectively. Jackson et al. [18] extended 
the (t,n)~ threshold secret sharing scheme to the multi-secret 
case, named {k, t, ri) -threshold multi-secret sharing scheme, 
which holds the robustness property and confidentiality 
property. Robustness means that the shared k secrets can be 
recovered when t or more secret shadows are pooled, and 
confidentiality means that the shared secrets cannot recovered 
when t — 1 or fewer secret shadows are pooled. In verifiable 
multi-secret sharing, there are multiple secrets to be shared 
during a secret sharing process, and any cheating by a dealer or 
by participants can be detected. 

In the generation step of Shamir’s secret sharing scheme the 
dealer chooses /(x), a polynomial of degree n, and computes 
the secret shares 

Vi = ( X;, /(Xj)) for i = 0,1, 

In this paper, we analyze the plausibility of secret sharing 
schemes based on non-polynomials with achieving the 
properties of a perfect secret sharing scheme. Note that we 
suppose that all devices have the same precision without less 
than generality. 

The outline of this paper organized as follows: In section 2, 
we present some preliminaries consist of interpolation and 
Shamir’s secret sharing. Section 3 is devoted to introduce the 
Shamir’s secret sharing scheme based on non-polynomial 
interpolation. Finally, we present an example of the scheme 
in section 4. 
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II. Preliminaries 

A. Interpolation 

Interpolation problem is one of the most important topics in 
applied mathematics. 

Definition 1. Consider a family of function of a single 
variable x, 

(p(Xj CLq, CL^, , CL-ji) ’ 

having n + 1 parameters a 0 ,a lf ...,a n , whose values 
characterize the individual functions in this family. The 
interpolation problem for cp consists of determining these 
parameters a t so that for n + 1 given real or complex support 
points ( x if fi ) the relation 

(p(Xi, CLq, Ci ... , CLji) fi 

holds for i = 0,1, ... , n where x t + x k for i =£ k. 

Interpolation problem is called linear interpolation problem 
if cp depends linearly on the parameters a t s, i.e. 

cp(x, a 0 , a 1( ...,a n ) = a 0 <p 0 (x) + a^ix) + •■• + a n <p n (x) 

where 

<Po(x),<Pt(x), -,<Pn(x) 

are functions of variable x. This class of problems includes 
the classical one of polynomial interpolation in form of 

cp{x, a 0 , a lt ... , a n ) = a 0 + apx + a 2 x 2 + — I- a n x n 

as well as trigonometric complex interpolation in the form of 

cp{x, a 0 , a lf ... , a n ) = a 0 + a x e xl + a 2 e 2xi + — I- a n e nxi . 

The class of linear interpolation problems also contains 
spline interpolation. The class of polynomial interpolation can 
be solved by Newton’s divided differences. The rational 
interpolation problem is defined as follows: 

Definition2 .Consider the rational function 

<pO,v)( x ) _ r ( x )/s(x) 

such that p + v = 7i, 

r(x) = a 0 + apx + — I- a M x M 

and 

s(x) = b 0 + b x x + — I- b v x v 

are two polynomials of degree p and v respectively. Rational 
interpolation problem is the determining n + 2 parameters 

a 0 , ci\ t ■■■ t a^, b^, b^, ... , b v 

of (p^ ,v \x ) such that (p^’ v \xi) = using n + 1 support 
points ( x if fi ), for i = 0,1, ...,n. 

In Definition 2, if (p^ ,v \xi ) + then the support point 
(Xj,/j) is called a pole point. Berrut and Mittelmann suggested 
a method to avoid poles by using rational functions of higher 
degree [21]. Some mathematician surveyed various aspects of 
this kind of interpolation and proposed several algorithms. For 
more details see [22-24]. 


Identify applicable sponsor/s here. ( sponsors ) 


B. Shamir ’s secret sharing scheme 

A secret sharing scheme consists of a dealer and n + 1 
participants P 0 , P 1 ,...,P n . The dealer computes secret shares 
and distributes them to the participants such that a qualified 
subset of participants can recover the secret. In fact, a secret 
sharing scheme consists of three steps: generation, distribution, 
and reconstruction. In the thegeneration step, all the necessary 
data are computed. In fact, the dealer computes the secret 
shares such that any information about secret does not leak. In 
the distribution step, the dealer distributes secret shares to 
participants. Finally, in the reconstruction step, only a qualified 
subset of participants can retrieve the secret. In a verifiable 
secret sharing scheme, the participants can verify their 
shares. In addition, in some schemes the dealer can distinguish 
malicious participant. Suppose that S denotes the secret. The 
Shamir's ( k,n + l)-threshold secret sharing scheme is defined 
as follows: 

Definition 3. The Shamir's (k, n + 1) -threshold secret 
sharing scheme consists of three algorithms: 

• Generation: The dealer chooses the integer numbers 
x 0 , x lf ... , x n , the prime number p and a function /(x) 
of degree k such that 5 = /( 0) mod p. Then he/she 
computes the secret shares s t = (x^f^Xi) mod p) for 
i = 0,1 , , 7i. 

• Distribution: The dealer distributes s t to P t for 
i = 0,1, ... ,n. 

• Reconstruction: Any k members of participants 
retrieve the secret using polynomial interpolation of 
their shares. 

III. Pole Points 

In this section, we prove the existence of pole points in the 
rational interpolations. 

Theorem 1. Let P = {(x^yj), 0 < i < n} be the set of 
support points. Suppose that k is the number of support points 
with the same width (1 . Then the rational interpolation function 
<p(M+)(x) of P is /?, if max{p, v] + 1 < k where p + v = n. 

Proof. It is enough to show that if max{jU, v} + 1 = /c, then 
<p(+,v)(x) = /?. We find the rational interpolation function 

<P M,V 00 = (a 0 + ••• + a M x M )/(h 0 + ••• + b v x v ) (1) 

using 7i + l support points 

{(*0, ■■■ > (.%k> Tfc)' ■■■ > (Xji> 3+)} 

where 

maxjjU, v} + 1 = k, yj =£ (3 (k < j < 7i) and p + v = 7i. 

We substitute these support points to relation (1) as follows: 
r + b ± x o + ••• + b v (x 0 ) v ) = a 0 + ••• + a M (x 0 ) M 

^ p(b 0 + b x x K _ t + ••• + byix^Y) = a 0 + ••• + a^x^Y 

y K (b o + b x x K + ••• + b v (x K ) v ) = a 0 + ••• + a M (x K )^ 

, y n (b o + Wx n + ••• + by (x n ) v ) = a 0 + ••• + a M (x n ) M 
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The above relation can be written as AX = 0 in which 

^ and 


A = X = ( b 0 , ... , b v , a 0 , ... , a,y 


A i 


A , = 


y? Px 0 

P P*2 
\P Px K - 1 

-1 -X 0 

— 1 — 

-1 -X„_1 


A (^o) V 
A (* 2 ) v 

P (X K -lY 


KX(V + 1) 


Cr 


C, = 


i4 3 — 


yk*k 

Xk+i Xk+iXk+1 

y n y-R^n 

-1 X K 
-1 Xv + 1 


- -Oo )" 

_ (^K-l)' 1 / KX(lU + 1 ) 

y K (* K ) 

y K +ife+i) 


(n-jc)x(v+l) 


(n-fc)x (/*+!) 


px 0 - 

P (x 0 y\ 

Px 1 - 

p (*l) v 

P*v 

P(x v y/ U 

1 -X Q 

- -(* 0 r 

1 —Xi 

-(x } y 

1 -Xy 



( 10 ) 


(v+l)xO+l) 



c, = 


c = 


/£ )5x v+1 

P Px V +2 
P P*n 

— 1 —%v + l 

-1 -^v+2 

-1 ~X„ 


P (x v+1 ) v \ 
P (*y + 2) V 

/? (x M ) v 


(M-v)x(v+l) 


-(*y +2 r 


(ll-v)X(ll+ 1) 


We show that C is equivalent to 


We have three cases: I) v = II) v < ii and III) v > fi\ 
I) v = ji: In this case, consider the matrix 


C = 


Ci C 2 C 3 

Q c 3 c 6 


C = [A t \A 2 ] 

fP - P (*o ) V 

P /? (*i) v 

- HXk-iY 


~Xq 

-Xi 


-(X 0 ) V 

-fe) V 


in which 

Ci = /^(v+l)x(v+l)> C 2 = ~ f(v+l)x(v+l) > C 3 = 0 (v+l)x( i u_v) > 
^4 — ^(yU-v)X(v + l) > Qj — ^(jU-v)X(v + l) » Cfr — I( J u-v)X( J U-v)’ 


-1 — X, 


7C-1 




and the matrix F = (/?/ KX(v+1) | -l K x(v+i)) KX(2v+2) 

every row of matrix C is a linear combination of the rows of 
matrix F. Therefore, the matrix A is equivalent to the matrix 
(pi -i 


Since the matrix [C X |C 2 ] is equivalent to the matrix [C x |C 2 ] 
Kx( 2 v+ 2 ) us i n g case I), the matrix C is equivalent to the matrix 

. Note that _ / r f q< 


~, = (C X C 2 C' 3 \ 
J VC 4 c 5 C 6 J 


(PI -I) 

U 3 aJ 


using elementary row operations. In other 

13 ^ 4/ 

words, AX = 0 is equivalent to AX = 0 . The first k rows of 
AX = 0 yield a t = pb t for i = 0,1 , ..., v. Therefore 


Since any row of [C 4 \C S ] is a linear combination of rows of 
[Ci |C 2 ], the matrix C is equivalent to the matrix 


C" 


cp v,v (x) = 


a 0 + a x x H 1- a v x v 

b 0 + b t x + — I- b v x v 
/3b 0 + pb ± x H h pb v x v 


C\ C 2 C 3 

0 0 c 6 

Cn C, 


b 0 + b ± x + — I- b v x v 


= P- 


(C 2 c 3 \ 

Moreover, the matrix V = I ^ ^ I is invertible, because 

it is Vandermonde’s matrix. Therefore, the matrix C is 
equivalent to the matrix 


II) v < fi : In this case k = maxjjU, v} + 1 = /i + 1 
Consider the matrix 

c = [AMz\ = 


C" — ( c± c 2 

VO 0 I ) 


■p 

Px 0 •• 

•• /?(*o) V 

-1 -x 0 •• 

• -c*o y 

p 

Px 1 •• 

;• /? (^i) v 

— 1 — x x 

■ ~(x t y 

p 


•• p (x,y 

-1 •• 

• ~(x,y 


in which the matrix 7 ( ' jU _ V ) X ( m _ V ) is an identity matrix. The 
matrix C'" is equivalent to the matrix 


q — ( Ci c 2 oV 

Vo 01 ) 


We rewrite C as C 




in which 


(^+i)x(^+v+ 2 ^y elementary row operations. Finally, the first k rows of 
CX = 0 yield 

/ 3b 0 + /?h x x H — (3b v x v + 0x v H h 0x M 

<p(x) = — — — — = B. 

J b 0 + b t x + — I- h v x v H 


305 


https://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



International Journal of Computer Science and Information Security (IJCSIS), 
Vol. 14, No. 6, June 2016 


Case III) v > ir. This case is similar to previous case. 

□ 

The following corollary shows the existence of pole points 
in a rational interpolation. 

Corollary 1. Let P = {(x^y^), 0 < i < n} be the set of 
support points. Suppose that k is the number of support points 
with the same width /?. 

i) If k > maxfg, v} + 1 then the rational 

interpolation of the members of P has at least 
n — k = [I + v — k pole points. 

ii) If k < maxfg, v} + 1 then the rational 

interpolation of the members of P has not any pole 
point. 

This corollary shows that the rational interpolation function 
(x) of the points 

p = {(*„, /?), ... , (>*-!, /?), 0 K . y K )> ■■■’ ( *n , y n )} 

has not pole points if max{ i u, v] > k. 

IV. The proposed scheme 

We know that the secret 5 and the values 

X 0 ,X 1 ,...,X n , f Oo). /Ol). - . /On) 

are integer numbers in Shamir’s secret sharing scheme. Also, 
the dealer is constrained to choose a polynomial function f (pc) 
in which the degree of function f(x) is n. In this section, we 
use Shamir’s (n + l,n + 1) -threshold secret sharing scheme to 
introduce a new (n + 1, n + 1) -threshold secret sharing 
scheme based on rational interpolation. Suppose that cp^ ,v (x) is 
a rational function in accordance with Definition 2. We propose 
a new (n + 1, 7i + 1) -threshold secret sharing based on 
rational function as follows: 

• Generation: The dealer performs the following steps: 

(1) Choose two integer numbers ji and v such 
that and fi + v = n. 

(2) Choose n + 1 integer pairs (x^y^) for 
i = 0,l,...,n such that the condition of 
Corollary 1 is satisfied i.e. k < maxjjU, v) + 
1. 

(3) Construct the rational interpolation 
(p^’ v \x) = r(x)/s(x) using the pairs 
(x if yi) , i = 0,1, ... ,n such that r(x) and 
s(x) are two polynomial of degree /i and v 
respectively. 

(4) Choose and publish the number w where 
w ^ x t for i = 0,1, ...,n. 

(5) Choose and publish the number a where 
a = S + (p^’ v \w ). 

• Distribution : The dealer performs the following 
steps: 

(1) Publish the numbers fi, v, w and a. 


(2) Distribute the pairs (x t ,yi) to participant P t 
for i = 0,1, ... ,n. 

• Reconstruction: If all of participants pool their shares 
then they can retrieve the secret by and performing the 
following steps: 

(1) Compute the rational interpolation (p^ ,v \x) 
of shares. 

(2) Compute S = a — cp^ ,v \w). 

Since we use the concept of Shamir’s secret sharing, thus 
the proof of reconstruction phase is obvious. Note that 
Shamir’s (n, n) -threshold secret sharing scheme based on 
polynomial functions will be obtain by choosing 

v = 0, a = 0,w = 0,fi = n and S = (p n,0 ( 0) 
in the proposed scheme. 


V. Example 

Suppose that the dealer want to construct (4,4) -threshold 
secret sharing scheme in which n = 3 and 5 = 12. The dealer 
constructs (3+l,3+l)-threshold secret sharing scheme by the 
following steps: 




Generation: The dealer performs the following steps: 


(1) Choose two integer numbers (1 = 1 and 
v = 2 such that and fi + v = 3 . 

(2) Choose 7i + 1 = 4 integer pairs 

(5, —1), (1, —4), (— 1, —2), (2, —3). 

Note that k = 0 < maxjj u, v} + 1 = 3. 


(3) Construct the rational interpolation function 

-2x- 14 


<P 


( 1 ' 2 ) 0 ) = 


4 - x + x 2 

using the integer pairs 

(5,-1), a -4), (-1,-2), (2, -3). 

(4) Choose and publish the number w = —2 
where w =£ x t for i = 0,1, ...,3. 

(5) Choose and publish the number a where 

a = 12 + 2) = 11. 


• Distribution : The dealer performs the following 

steps: 

(1) Publish the numbers /i = 1, v = 2, w = —2 
and a = 11. 


(2) Distribute the integer pairs 

(5,-1), (1,-4), (-1,-2), (2, -3) 

to the participants P 0 ,P lf P 2 , P 3 respectively. 

• Reconstruction : If all of participants pool their shares 
then they can retrieve the secret by and performing the 
following steps: 

(1) Compute the rational interpolation function 
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<p (1 ' 2) (x) = 


-2x- 14 
4 — x + x 2 


using the integer pairs 

(5,-1), (1,-4), (-1,-2), (2, -3). 

(2) Compute the secret as follows: 

S =11- <p (1 ' 2) (— 2) = 11 + 1 = 12. 

VI. Conclusion and future works 

Our purpose was to implement Shamir’s secret sharing 
scheme using rational interpolaions. We define a new (n + 
1, n + 1) -threshold secret sharing scheme based on rational 
interpolaions. We showed that the number of pole points 
depends on the number of support points that have the same 
width. In other word, it is proved that the rational interpolation 
function cp^ >v ^ is the constant function /? , if the number of 
support points, which have the width /? , is greater than 
max{g,v}. We showed the efficiency of our scheme by giving 
an example. There is still a lot of work to be done in order to 
improve the capabilities of the scheme: it would be good to 
find a ( k , ri) variant of the scheme with k ^ n and a way to 
make it multi-secret (to allow sharing several secrets instead of 
one secret shared on each round). 
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Abstract — Although there are various biometric techniques, 
like fingerprints, iris scan as well as hand geometry, the most 
efficient and widely-used one is face recognition because it is 
inexpensive, non-intrusive and natural. 

In our paper, we present an approach aiming at implementing 
a full architecture which represents an efficient system of face 
recognition.For this, an attempt is proposed for each system stage. 
At the beginning, we develop a novel approach to detect faces 
existing in 2D color image. This approach focuses mainly on 
how to implement a selection of skin color before using neu- 
ral networks and Gabor filters. This approach represents an 
improvement of existing approach especially because it aims to 
minimize the computation time. Indeed, the skin detection step 
avoids wrong detection and to help the system detect the face in 
the right areas and minimize the research time and subsequently 
the Gabor filter will be applied only on the localized skin space. 
Later, the face features obtained by the Gabor filter represent 
the input of the neural network classifier to decide whether an 
input image pixel is a face pixel or not. 

For 2D face recognition, we propose likewise a novel approach 
that we call HMMLBP (a combination of the two tools Hidden 
Markov Models HMM and Local Binary Pattern LBP). It 
allows classifying a given 2D face image through utilizing an 
LBP tool to extract features. In order to validate our whole 
system performance, we show experimental results obtained when 
applying our proposed algorithm on benchmark face databases, 
respectively AT&T, Yale and Feret. 

I. Introduction 

The invention of computer which is able to save and 
read a huge amount of information was behind the emer- 
gence of some digital biometric systems [1][2], such as face 
recognition, a widely-used technique [3][4][5]that has been 
dealt with by many research studies [6] whose applications 
contain systems of automated surveillance, reconstruction of 
faces, access control, monitoring of security, identification of 
mug shot, designing human computer interfaces, diagnosis of 
diseases, communication through multimedia tools, suspect 
versus perpetrator verification, planning of treatment and iden- 
tifying missing and victim individuals. 

As depicted in (Fig. 1), the system of face recognition is 
divided into three steps: face detection, feature extraction and 
finally face recognition. 

Any system of face recognition starts with detecting faces in 
a given image. 

Obviously, it is the method of detection which makes the 


Face 

detection 


Feature 

extraction 


Face 

recognition 


Fig. 1. Steps of face recognition system [7] 


recognition system efficient. Generally speaking, a face de- 
tection system can decide whether the image contains a face 
or not and returns the location and the extent of each face in 
the image if one or more faces are present. 

Face detection in the image is considered very difficult due 
to the variation of color, position, orientation, lighting, facial 
expressions (smiles, anger....) and some morphological fea- 
tures (mustache, beard, glasses...). All these obstacles prevent 
the system effective detection and result in a decrease in its 
detection rate [8]. 

After a face has been detected, the task of feature extraction 
is to obtain features that are fed into a face classification 
system. Feature extraction is also a key to the animation and 
the recognition of facial expressions because the performance 
of the whole system depends on it. In this step also known 
as indexing or modeling, is extracted from the detected face 
image, a characteristic vector (signature) that is sufficiently 
representative of a given face and which models the much 
more precise than the raw image departure. 

This new representation of the face must have both the 
uniqueness property for each person and the property of 
discrimination between different people. Depending on the 
type of classification system, features can be local features, 
such as lines or facial features (eyes, nose, and mouth, etc....). 
Face detection may also employ features, in which case fea- 
tures are extracted simultaneously with face detection. Finally, 
recognition means the authentication and identification. The 
latter includes comparing one face with many other ones to 
fetch unknown identity from a set of known possibilities. The 
former consists in comparing one face with another to prove 
the identity claimed. Besides, recognition is strongly linked 
to classification in which the major issue is the identification 
some persons having a number of common characteristics. 

A. Overview 

Our paper introduces an approach dealing with such issues. 
Its objective consists in implementing a system of recognizing 
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2D faces tolerating the aforementioned problematic. In order 
to implement this system, we have proposed many solutions 
for different face recognition steps. The rest of the paper is a 
discussion of our face recognition system implementation. It 
also presents some experimental results. 

The remaining parts are the following: section 2 illustrates a 
number of related studies dealing with recognition of face as 
well as its detection. In the third section, the proposed face 
system architecture is depicted. Then, in the fourth section, 
the step of face detection, which relies on Gabor filter, skin 
detection as well as neural network, is explained. After that, 
the step of classification and that of decision are presented. The 
sixth section enumerates some experimental results obtained 
by utilizing AT& T, Yale and Feret data stores. 

II. Related Work 

A. Face detection 

Face detection approaches aim at automatically locating the 
position of the face in a given digital image. In this part, we 
are interested in the major and most essential face detection 
techniques. The latter permit the detection and localization 
of the real position of the face. The performance of these 
approaches relies on the rate of detection. We may distinguish 
four major categories of face detection techniques: 

1 ) Image-based approaches: Actually, these approaches are 
methods of classification having an algorithm of training. 
Generally speaking, they use automatic techniques of learn- 
ing. Besides, they treat the classification issue as the major 
detection face problem. The two most famous approaches of 
detection, involving two important phases: test and learning, 
are SVM (support vector machine) [9] and neural network 
[10] [11] [12]. The latter uses a classifier for the detection of the 
face various sizes in a given image. Prior to the network input, 
the image should be pre-processed by means of histogram 
equalization, and then scanned using an 18 * 27 window. 

It is obvious that the above-mentioned technique, which can 
overcome the problem of noise, is a difficult-to-construct clas- 
sifier. Added to that, considering the algorithm of detection and 
the methodology, SVM is different from the neural network. 

2 ) Features based approaches: They are utilized to localize 
the face. They are sub-divided into two categories. The first 
one, based on detecting the skin[13], uses the color to effec- 
tively detect, in the image, the chromatic areas. 

Thus, it allows solving the problem of scanning the image, and 
identifying if the pixel represents the color of the skin or not. 
This method, where the color of the skin gives a clear idea 
about the scale and variation in orientation, constitutes a so 
efficient and time-saving technique. As shown in [14] [15], the 
second sub-category relies on the invariant features of the face. 
It utilizes a hypothesis in order to localize the top of the face, 
and later on browse it to distinguish the eyes marked with a 
sudden rise in the contour density computed by a black/white 
percentage through browsing the horizontal obtained planes. 

3) Template Matching approaches: These techniques are 
normally based on measures of similarities. Generally, the 
pixels intensities between a predetermined template and many 


image sub-areas must be analyzed. This approach is based 
on the computing and measuring of the relation between the 
template image and the candidate one; that is why, these 
methods utilized many templates [16] [17]. 

4) Knowledge-based approaches: They deal with features 
of the face (lips, eyebrow, nose, etc) and determine the 
correlation between them. Thus, in [18], the author utilized 
rules in which he located the facial features through the 
technique of projection introduced in [19] .In [20] the author 
used face image resolution. 

It is easy to extract the image facial features and their 
relationships. However, such techniques are influenced by the 
variation in pose. Besides, we should take all images from the 
front. 

B. Face recognition 

It is a wide and very interesting area of research[l 1] [1] [14] 
[21]. The process of face recognition involves 4 main steps as 
presented in (Fig. 2). 



Fig. 2. Face recognition general process 


In [22], the author defined coding as the act of taking the 
face image from the concrete world in which a face is con- 
sidered as a dynamic identity which changes constantly under 
several factors influence such as lighting and illumination. In 
this stage, this image is transformed into presentation of gray 
level. It is, then, pre-treated. This phase is so important as it 
ameliorates the quality of the image (elimination, detection, 
noise, head position, etc ). 

The analysis, also named the extraction of the image 
features[23], is the next step. It is. This phase is to extract 
relevant information which will be stored in the memory. After 
that, and during the learning step, the realized peoples obtained 
representations were saved. At last, the whole process finishes 
with the step of decision. The latter constitutes an estimation 
of the difference calculated between a couple of images. 
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During the last 30 years, many face recognition approaches, 
using 2D images have been introduced [4] [11]. Among these 
proposed methods, we distinguish 3 main categories: hybrid 
approaches, global approaches and local ones. 

1) Global approaches: Such methods, like Fisherface [24], 
Eigenface [25], SVM (Support Vector Machines) [26] and LDA 
(linear discriminant analysis) [27], utilize informations given 
by the face without being segmented. For these methods, each 
face image of the database, presented by a matrix of n lines 
and m columns, is first transformed into a vector (n x m, 1) 
by concatenating the columns of the matrix. Then, thus vector 
is an input for the classifier. 

These methods are generally sensitive to pose and illumina- 
tion. Furthermore, they necessitate a memory of very large 
size. 

2) Local approaches: The second category, called local 
methods such as HMM [28], elastic Bunch Graph Matching 
[29] and [30], are based on models.These methods use prior 
knowledge taken about the facial morphology and rely on its 
local characteristic points. They also present other techniques 
which take into account the non-linearity by constructing a 
local features space and using the filters of appropriate images. 
Generally, this category of approaches is less affected by 
variation in pose, illumination, and facial expression. But, they 
are more difficult to implement. 

3) Hybrid approaches: As example, we may mention 
Genetic Programming-PCA[31], PCA-Gabor [32], etc They 
benefit from the global and the local methods through com- 
bining the extracting the features of the local appearance with 
detecting the geometrical characteristics. Such solution makes 
possible to rise the recognition performance stability during 
lighting, the variations in facial expressions and position. 
Along with the evolution of 2D face recognition field and due 
to the growing development of 3D information acquisition and 
applications [33], 3D face recognition [34] is getting more and 
more attention. Hence, many researches have been exposed 
[35]. 


This test requires an implementation of the belonging condi- 
tions of skin pixels. As we apply this test, the pixel value 
is, thus, kept. Otherwise, the white pixel is returned. As 
consequence, the obtained image will only contain skin pixels. 
Obviously, the described technique relies on thresholding in 
relations to chromatic space. 

1 ) Chromatic space: We can present the color of the skin in 
various chromatic spaces like HSV, RGB, YCbCr, normalized 
RGB, etc 

In This section, we detail the space of YcbCr. In fact, the latter 
can reduce the illumination. It allows also the separation of 
the chromatic space by the values given in [38]. 

Y = 0.2999 * R + 0.587 * G + 0.114 * B (1) 

Cr = R — Y (2) 

Cb = B — Y (3) 

The values of Cb as well as that of Cr give the chromatic data. 
The skin color is used at a given interval of the thresholding 
order to define the skin areas in the image. We consider the 
thresholding described below: 

YCbCr( 1) with a threshold ((85 < Cb < 135) 

and (135 < Cr < 180)) (4) 

Our approach aims at minimizing the time devoted for research 
in a given image. For this reason, detecting the color of the 
skin is used in order to save time, and do not work on an area 
having no skin color. The performance time of the proposed 
approach is essentially relies on two important matters: the 
time of learning and that of research. 

Actually, it is somewhat difficult to minimize the former 
because it is strongly related to the number of the classified 
images in the step of learning. Nevertheless,, the latter can be 
reduced when the color of the skin is used. The detection of 
the skin, applied on image containing many faces, is presented 
in (Fig. 3). 


III. Our proposed detection approach 

Now, we will explain our proposed approach relying on the 
colors of the skin to remove areas without this feature, and 
the Gabor filter for the extraction of the texture of the image. 
However, in the classification phase, we apply the neural 
network. The combination of both Gabor Wavelet[36] and 
Neural Network is not new [37] but our improvement consists 
of incorporating skin detection before feature extraction in 
order to reduce Gabor process to areas likely to be face pixels. 

A. Skin areas segmentation 

For the detection of faces in the images, we start first by 
detecting the areas of the skin. That is to say, in this step, we 
detect the pixels representing the skin. Actually, skin detection 
is very important regarding the time of computation because 
areas of the skin can strongly present the face. Such time 
saving is so important since the skin parts size is smaller if 
compared with that of faces image. 




Fig. 3. Example of skin detection 


B. Extraction of features using Gabor filter 

In 1946, Dennis Gabor introduced the so-called Gabor filter 
[39] which is considered so popular tool of analysis. It is 
strongly linked to Gabor wavelet because the latter shows an 
orientation and frequency sinusoidal carrier. 

The convolution of an image is performed using 8 orientations 
as well as 5 spatial frequencies in order to get 40-matrix cells 
which consist of 8 columns and 5 lines. The mentioned matrix 
contains its kernel answer since the model of the frequency 
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depicts a given image according to the periodic structures 
through its decomposing by a simple periodic functions base. 
The representations in frequency and orientation of Gabor 
filter are similar to those of the human visual system. We 
consider the kernel of the Gabor filter as the product of 
a complex sinusoidal wave with a Gaussian envelope. The 
representations in orientation and frequencies, given by Gabor 
filter and those of mans visual system are almost the same. 

In this work, the Gabor filter is viewed as a result of a 
complicated sinusoidal wave having a Gaussian envelope[40]. 
An example of facial representation using Gabor wavelet is 
illustrated in (Fig. 4). 



(a) (b) 


Fig. 4. An example of facial representation using Gabor wavelet: response 
of the amplitude (a) and response phase (b) using 40 Gabor kernels (8 
orientations and 5 scales) 


C. Neural network 

It represents a method of classification applied in various 
applications. It relies on both the phase of learning and that of 
test. The former permits to search for producing automatically 
rules from the data stores of learning which involves exam- 
ples. i.e the training of the network. Nevertheless, in the test 
phase, we prove the presence of the face in the test image. We 
also distinguish non-face images from face ones. 



Fig. 5. Face detection using neural network 


We use the suggested neural network in order to classify 
the pixel of the image as being face or not. However, we 
cannot propose a standard methodology to define the best 
neural network to be used. 

The face detection execution time of a system based on the 
neural network depends on two factors: 

• The time of learning: this factor is closely related to 
the numbers of the learning images, the input vector 
components number. It depends on the characteristics 


of the learning technique which has a time not easy to 
minimize. 

• The time of research: is intended to be reduced by using 
the segmentation of region. In fact, to detect a face in an 
image, we will not scan the entire image but only skin 
regions. 

IV. Our proposed Face recognition approach 
HMMLBP 

Here, we aim at determining, from the database, the identity 
of the corresponding person whose face is called test image. 
In this phase, information about the face is viewed, by the 
HMMs, as a sequence variable in time. The flowshart of this 
approach is given by (Fig. 6). 


h 


HMM 



Decomposition into blocs 


Feature extraction 


^The use of the x 
LBP 

only in this 



Probability calculation 


Probability calculation 



* Maximum selection <- 


Recognized 
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Probability calculation 

1 

i 

i 




Database 


Fig. 6. Our proposed HMMLBP approach functioning 


A. Hidden Markov Models (HMM) [29] 

In 1975, the HMMs is first used in various fields especially 
in speech recognition. These methods show a statistic model 
decomposed by unidirectional transitions and states. 

This HMM tool is determined by this triplet A = 

(A,B, 7r) having : 

A = the state transitions probability matrix as = 

P[q t = Sj\q t -i = Si] with 1 < i < N; j < N; 0 < 
dij < 1; a ij = 1; N denotes the model states 

number; t is the time; q t shows the state of the model 
at t (a given instant) with 1 < t < T; T represents the 
observation sequence length; S denotes all the states set; 

B = {bj(k)}\ presents the observation symbols probability 
matrix with bj(k) = P[O t = Vk\qt = Sj] and 1 < j < 
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TV; 1 < k < M; M is the various observation symbols 
number; O t denotes the symbol of observation at t a given 
instant; V = {Vi . . . Vk} shows all possible symbols of 
observation set; 

tt = { 7 r^}: denotes the initial state distribution with 7 ^ = 

P[qi = Si] and 1 < i < N. 

The Hidden Markov Model have efficiently been applied 
in many fields like speech recognition, features recogni- 
tion and face recognition, etc For that reason, we use the 
approach of HMM to recognize 2D face. 

For each face image, seven facial regions (hair, forehead, 
eyebrows, eyes, nose, mouth, and chin) must be placed 
from the top to the bottom in a natural order even 
if it is taken under a small rotation. Then, to each 
region we assign a state in left-right order. The (Fig. 7) 
shows the face model states and their non-null transition 
probabilities. 


latter will be 0 if the previously-mentioned value is less 
than the current pixel. 

Through multiplying the matrix containing the two 
values 0 and 1 with the weights of LBP, powers of 
two, we will give the current pixel LBP value. Then, 
we obtain our LBP value by adding all its elements. 
(Fig. 9) shows an LBP code calculation with a threshold 
equal to six. Few years later, the LBP code calculation 
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LBP = 1*1 + 0*2 + 0*4 + 0*8 + 1*16 + 1*32 + 0*64 + 1*128 
= 1 + 16 + 32 + 128 = 177 


Fig. 9. Computing of the LBP code 
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was extended by using neighborhoods having different 
sizes. The pixel at the center is surrounded by a circle 
having R radius. The P values represent the set of points 
grouped on the circle edge. Its values are compared with 
those of the central pixel. 


Fig. 7. The recognition of face from the right side to the left using HMM 


B. Our approach HMMLBP 

1 ) Decomposition into Blocks: This step is to divide both 
the image of the face and that of test into seven different 
areas (eyebrows, nose, mouth, forehead, chin and eyes). 
As shown in (Fig. 8) , each region is given a state q. 




Hair 

Forehead 

1 1 




Eyebrows 

f . 

Eyes 

1 — T 


1 L 

Nose 

Mouth 





Chin 


Fig. 8. The decomposition of the image into blocks 


2) Features Extraction: The extraction of data consists 
in extracting relevant information from the raw data 
(image of the face, the face main areas). This phase 
is very important in every method of face recognition. 
For that reason, we apply an LBP approach for features 
image extraction. 

• Local Binary Pattern (LBP) [41] 

Computing the value of the LBP is to threshold the eight 
direct neighbors of each pixel by a determined threshold 
having as value the gray level of the current pixel. If 
the value of the neighbors is more than or the same as 
the current pixel, they will be assigned the value 1. The 


(P,R) The pixel P points neighborhood with a 

radius R. 

Obviously, some neighbors are not directed related to 


• 

• 
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Fig. 10. Symmetrical neighborhood set available in a circle 


pixels. Therefore, to estimate the gray level value of 
the neighbor, bilinear interpolation is used. In order to 
calculate the LBP existing in P pixels neighborhood, in 
a given radius R, the equation below is followed: 

p - 1 

LBPp, R = s(g p — gc ) 2 (5) 

p = 0 


With: 


• g p : the neighbor pixel gray level; 

• g c :the central pixel gray level; 

• 5():a function defined as: 


S(x) 


1 if x = 1 
0 else 


( 6 ) 


As soon as all the image pixels LBP code is computed, 
the image coded using LBP operator, is than divided into 
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small regions in order to construct each one histogram. 
At the end, we obtain a huge histogram representing the 
features of the face in an image. 


Original 

image 


Image divided Histogram of Histogram of 

into blocs each bloc characteristics 



Fig. 11. Face representation using the histogram of the LBP code 


3) Probabilities computing and the maximum selection: 
In the database, each image of the face is transformed into 
a sequence of observations. Besides, for every class of the 
40 obtained ones, a learning model is computed through 
defining the parameters of the HMM (matrix of the initial 
probabilities, matrix of the transitions probabilities and 
matrix of the emissions probabilities). 

Each Hidden Markov Model represents a different in- 
dividual. In its turn, a test image is transformed into a 
sequence of observations and an HMM is computed for 
this type of image depending on the parameters of the 
Hidden Markov Model. 

Then, we recognize the person’s face through establishing 
a match between the each learning pattern and the test 
model. 

After that, we calculate a probability for all registered 
persons models in the database. The model which has 
the highest probability corresponds to the searched indi- 
vidual’s identity. 

V. Results 

A. Face detection 

In this step, we have already presented in [42] some 
results obtained by applying the proposed methods of face 
detection on some images having different orientations, 
lighting, positions, scales, skin colors and different num- 
ber of faces. For each image, we have shown the results 
obtained using the face detection approach proposed by 
Omid Sakhi in [43]. 

This method, which we have called Gabor-NN, does not 
use skin color, but only Gabor and neural network (Gb- 
NN). Then, we have revealed some findings given by our 
proposed approach applying Gabor filter, skin detection 
and Neural network. We have called the new introduced 
approach (Sd-Gb-NN). 

If we compare the proposed method detection results with 
those of another system that does not depend on the color 
of the skin, it will be obvious that our approach is more 
efficient. 

B. Face recognition 

The application of the HMMLBP is tested on standard 
databases Yale [44], ORL [45] and Feret [46]. 


The ORF database (Fig. 12) , called AT&T, contains 40 
different individuals having 10 distinct images of faces. 
Therefore, we will obtain 400 images of the face having 
256 gray levels. Their size is 112* 92. All these images 
have a dark and homogeneous background. They were 
taken with variance in times, angles, details (with and 
without hair style, glasses, beard ) and with different 
facial expressions (smile, surprised, angry,...) and with 
roughly 20 degrees in tilt. 



Fig. 12. A subset of face images from the ORL database 

The second database, Yale (Fig. 13) , consists of the 
frontal grayscale of face images representing 15 persons 
together with 11 images of the face for each individual, 
which gives 165 images. In fact, the variations in lighting 
involve center-light, right-light and left-light. However, 
the presence and absence of glasses represent the Specta- 
cle variations. Finally, the variations in facial expressions 
include happy, wink, sad, surprise, normal and sleepy. 



Fig. 13. A subset of face images from Yale database 

The third FERET database (Fig. 14) is bigger and more 
complex than the two others since it represents variations 
in pose, illumination, facial expression and occlusions. It 
contains a face images set collected by NIST from 1993 
to 1997. Each image represents a single face. Feret is 
composed of a set of reference faces called Gallery con- 
taining contains 1,196 face images and four different sets 
of probe images. The probe, called fafb, is a set of 1,195 
images of subjects taken at the same time as the gallery 
images but with different facial expression as same as the 
gallery set. The duplicatel probe set contains 722 images 
of subjects taken between one minute and 1,031 days 
after the gallery image was taken. The duplicatell probe 
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set is a subset of 234 images of the duplicatel probe set 
taken after 18 of the gallery image. 

Finally, the fafc probe set contains 194 probe images of 
subjects with d different lighting conditions. In our paper, 
we reduce all images resolution of 768 * 512 to 56 * 46. 

IlllE 

fa fb Duplicate^) fc Duplicate(ll) 


Fig. 14. A subset of different categories of probe set from Feret database 

The three tables above represent the results obtained 
while applying our proposed approach on the three 
databases mentioned below. 

TABLE I 

Comparative results of our approach with some others applied 

ON ORL DATABASE 


Approach 

Recognition rate 

HMM and the gray levels [47] 

87% 

HMM-DCT [48] 

99.5% 

HMM- SVD [49] 

99% 

SHHMM [50] 

99.5% 

Pseudo2DHMM [47] 

95% 

2DPCA Principal Component uncertainty [51] 

97.8% 

Wavelet transform and improved 2DPCA [52] 

92% 

Combination of HMM & SVM [53] 

100% 

HMMLBP 

99.5% 


Applying our face recognition approach to the ORL 
database, it is obvious that the LBP approach is more 
efficient as its recognition rate is higher than those given 
by other techniques mentioned in table I. Actually, the 
results of our HMMLBP method are better than the 
recognition rates obtained by SVD and the gray level 
which are respectively 99% and 87%. Besides, if we 
apply SVM or DCT with the Hidden Markov Model, we 
will obtain the same results. 

TABLE II 

Comparative results of our approach with some others applied 
on Yale database 


Approach 

Recognition rate 

SVM-PCA [54] 

99.39% 

SVM-ICA [54] 

99.39% 

PCA-2DPCA [55] 

92.8% 

PCA-LBP [56] 

93% 

HMMLBP 

99.33% 


By applying our approach on Yale, the obtained recogni- 
tion rate will be 99.33%, while that given by PCA+LBP is 
93%. We notice that combining SVM with ICA and PC A 
will result in 99.39% recognition rate. The latter is better 
than that obtained while using the proposed HMMLBP 
method. 

Applying our approach to Feret database which represents 


TABLE III 

Comparison of the results of some approaches obtained on 
Feret database faces 


Approach 

Recognition rate 

GWT-PHMM [57] 

90% 

MLBP [58] 

91.6% 

Gabor features + SVM + OG-SVM [59] 

92% 

SMQT + PDF + Data fusion methods [22] 

97.78% 

HMMLBP 

95% 


a bigger and more complex database due to different 
variations in pose, illumination, facial expression and 
occlusions we get a rate equal to 95%. This results 
are better than results obtained with MLBP and GWT- 
PHMM. However, results obtained by [22] still better than 
our ones. 

VI. Conclusion 

The face recognition field has been considered as so 
interesting subject that many researchers have dealt with. 
In this paper, we have proposed a novel face recognition 
system in which a solution was presented for each step. 
For face detection, we have proposed to use skin detection 
prior to Gabor filter and neural network in order to 
minimize the execution time by limiting the face research 
to the image skin regions. Then, we have combined HMM 
with LBP tool to recognize the detected face. 

Some results are presented to validate our proposed 
approach efficiency. 

For future work, we propose, first, to apply other classi- 
fication tools such as SVM and compare them with the 
obtained results. Also, our futur work will combine the 
PCA tool to reduce feature vector dimension in order to 
imrove our results. 

We propose also to exploit the work that we published in 
[60] in order to recognize person from a video sequence. 
For 3D face recognition, as we have proposed two novel 
approaches for remeshing 3D objects [61] and [33], we 
propose to develop a uniform remeshing scheme to es- 
tablish a sampling pattern across 3D faces. 
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Energy Efficiency techniques in cloud computing 

Altaf Ur Rahman, Fiaz Gul Khan, Waqas Jadoon 


Abstract — Cloud computing gaining popularity at enormous 
rate since from its emergence. CC changed the way that computing 
services are provided. On demand platform (PaaS), infrastructure 
as a service (Iaas) and software (SaaS) as a service through 
internet. Consumer use third party services instead of building his 
own infrastructure which need up-front investment and expertise. 
Cloud computing becoming popular for unlimited computing 
ower, availability , nice pricing , on demand services and quality of 
service. For availability and computing power the service provider 
expands their resource capacity to handle user requirements. This 
expansion in resources capacity lead to high energy demand. Two 
big issues for cloud computing is energy demand and 
security/privacy requirements. In this survey we will give a review 
on the latest techniques for energy efficiency in cloud computing. 
The main focus is on software base energy efficiency techniques in 
which we will explain the workload consolidation and resource 
management in detail. 

Index Terms — cloud computing, data center, energy efficiency 
techniques. 


I. Introduction 

Cloud computing is a platform which enable individual 
users and conglomerates to use infrastructure, platform and 
software as a service through internet instead of buying 
managing and developing their own. By using cloud 
computing (CC), companies reduce their operational cost 
while increasing their operational efficiency. With the 
emergence of Internet of Things (IOT) the CC usage will 
grow, in 2003 connected devices were 6.3 billion. The figure 
of active devices per person was 0.8. This figure will grow 
from 6.3 to 7.6 in 2020 and connected device per person will 
be 6.58 [1]. IOT device will use cloud platform because of 
low processing power and limited storage. 
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Every organization and individual use CC in one or other 
form using online services. Providing and managing such a 
flexible platform is a challenging task. Provider built data 
center that consist of hundreds of thousands of servers 
continuously running to provide uninterrupted services. 

Data center are energy hungry station for continuous 
running it requires a nonstop supply of power for running and 
cooling. Data center is a resource rich platform and the 
scheduling of resources is one of the challenging task. Cloud 
refer to a data centre where all the user requirements like 
hardware and software are provided in the form of 
preconfigured resources and remotely hosted applications [2]. 
Cloud service provider objective is to maximize their profit by 
efficiently utilizing the data center resources. Consumer of a 
service aim to be best served in term of cost and quality. 
Providing QOS provider have to increase the number and 
capacity of the resources which in turn increase the 
maintenance expenditure and decrease the profit that can be 
made possible by the use of virtualization. According to [3] 
energy consumption in data center can be classified in to two 
classes computing resources and physical resources. 
Computing resources consume about 50% while physical 
resources 40% the remaining 10% by power supply and other 
miscellaneous things. 

Energy efficiency is one of the biggest issue faced by data 
center. Two percent of the total power produced in USA is 
consumed by data center. Energy demand double since 2000 
[4]. Amazon estimate that energy related cost is forty -two 
percent of its budget. Data center are not environment friendly 
and contribute two percent of co2 emission. Energy 
consumption can be reduced by efficient utilization of data 
center resources. In a survey conducted in 2010 it is find out 
that idle server cause of producing 11M ton of co2 emission 
on yearly basis [5]. Energy efficient data center are not only 
economical but also environment friendly. 

The rest of the paper consist of five section. Section I 
introduction give a brief overview of cloud computing and 
energy efficiency challenges in cloud computing. Section II is 
about the energy efficiency trends in data center. Section III is 
about the software based techniques for improving energy 
efficiency in data center. The techniques presented by 
Belogazo in 2011 is more focused on power management. We 
will focus on software side because so far both the Hardware 
and software side is considered and the techniques on 
virtualization level is not studied in detail. Section four is 
about comparative analysis of different software base 
techniques on the basis of their strength and limitations. 
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II. Energy Efficiency trends at data center level 

Data center is a collection of connected server used by an 
organization for remote processing and storage. Data center 
(DC) give flexible platform to customer by hiding platform 
dependency. User has no need of any special hardware all he 
need is thin client. Cloud service provider trying to make DC 
flexible but harder for the service provider and data center 
manager. To efficiently perform user request DC have a 
hundred and thousands of servers which should be managed 
intelligently. Servers in DC consume huge amount of energy 
which is expensive and cause co2 that is 2% of global co2 
emission [6]. Building energy efficient data center is not only 
benefited for cloud service provider but also environment 
friendly. 

Initially researcher was focusing on the power distribution 
unit and power management module. They were involved in 
improving the flow of current and avoiding multiple 
conversion of alternating and direct current. This research 
suggests the use of energy star rating IT and non IT equipment 
in data center and green DC. Cooling requirements of DC 
resources is one of the prime energy demanding entity. The 
suitable location of data center can help in minimizing energy 
consumption in this regard. Location selection is based on the 
temperature and low-cost availability of hydroelectric power 
[7]. Using these techniques, no major gain is obtained. 
Researcher shifted from focusing on designing green DC and 
trying to find the energy efficient way in the traditional DC. 
DC operator work on implementing hardware base techniques 
for energy optimization. 

Using the hardware base optimization techniques data 
center operator did not get the required gain in energy. 
According to [8] it was poor program and software design. 
This initiated the trend of writing parallel program that could 
run parallel. Latter on it was found that resource management 
play important role in energy efficiency. This initiated the 
trend of developing algorithm for energy efficient 
management of DC resources. 

III. Energy efficiency techniques at data center level 

At data center level energy efficiency actions can be 
divided into hardware base, infrastructure base, software base 
and location based. In [8] author discus the various power 
management techniques both at hardware and software level 
while in [7] author focus more on software level techniques 
for energy efficiency. Further software based techniques for 
energy optimization are classified into five group’s resource 
management, Dynamic voltage and frequency scaling, parallel 
program and workload consolidation. In this survey we will 
extend the work of [7] by adding the latest techniques in 
resource management and workload consolidation. 


Data Center Level 
EE Technique 


Hardware Base 


DVFS 


Infrastructure 

Base 


Software Base Location Base 


Resource 

Management 


Workload 

Cosolidation 


Parallel Program 


Figure 1 : Energy optimization techniques at data center level 


A. DVFS 


The process of intentionally limiting the performance of the 
processor at the time when it consuming too much power 
while the task can be accomplished processing at lower 
frequency. Voltage and frequency have a direct relationship. It 
has been observed that server consume about 70% power 
when it is idle [3] saving this energy will have great impact on 
energy reduction. DVFS can handle this problem effectively. 
DVFS is basically intended and design for energy efficiency 
in embedded system. But this can be implemented and 
consider one of the most effective energy efficiency 
techniques for computation intensive server. However, this is 
considering not good for input output intensive processes [4]. 


Methods using DVFS techniques mostly consider 
homogeneous physical machine while less interest is shown 
by researcher toward heterogeneity. In [4] Jacob Leverich 
suggest a method to power on some of the resources and 
perform work on them while the rest should be in off state to 
save energy. The difference between consolidation and DVFS 
is that consolidation is applied as a whole regardless DVFS 
which is local [9]. But Wilies lang propos that running all the 
resources will perform the work in less time and more energy 
will be saved. Energy efficiency achieved through DVFS 
depend on the type of SLA either strict SLA lead to low 
energy saving 1.11% while more effective in relax SLA 
6.69% [3]. 


DVFS can be effectively used in CPU intensive task while it 
is not suitable in memory and Input output intensive task [10]. 
A history table is maintained when a request is received it is 
compared against the table to find out whether it is CPU or 
I/O intensive. After this calcification the frequency is adjusted. 

B. RESOURCE MANAGEMENT 

Resource management is the process of selecting computer 
resources such as computing, storage, network intelligently 
and allocating against the single or set of request received to 
meet performance objective of the user [11]. Resource 
management play starring role in energy efficiency. 
Identifying the type of request assigning the best possible 
resources in term of performance and price by looking at SLA. 
Management is basically the scheduling of resource. When a 
request is received the scheduler identify an optimal resource 
allocation by looking and analysing the current state of the 
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system [12]. Scheduling mostly involve with Virtual Machine 
management. Virtual machine management deals with VM 
migration. VM migration performed for three things 
performance, load balancing and energy efficiency. But as our 
focus is energy efficiency so we will look at the energy 
efficiency aspect. 

f T 

Resource Management 


{ ■ 

Scheduling 


Virtual 

Machine 


Figure 2: Resource Management Hierarchy 

In [13] energy aware VM migration techniques is proposed 
based on firefly algorithm. This algorithm work using three 
principle (I) fireflies attracted toward each other without 
considering his sex because fireflies are unisex (II) less bright 
will travel toward brighter firefly’s attractiveness decrease as 
the distance increase (III) the brightness of the firefly is 
controlled by the setting of the objective functions to be 
improved. Firefly algorithm perform energy aware virtual 
machine migration by migrating VM from the most loaded 
active node to the least loaded active node. The decision of 
migration consists of four steps most loaded active node 
selection, VM selection from the loaded node, destination 
node where to be migrated and distance update. For selecting 
loaded node first computational energy is calculated and a list 
is generated then a node computation time is calculated and 
values is put in table. In step third attractiveness is measured 
on the base of CE value the least CE is the first in the list. In 
step four distance is calculated and a list is obtained a node is 
selected based on CE value which is closest to the calculated 
distance. Compute load on individual VM and create a list in 
descending order. Migrate the top VM in VM list to the top 
node on AI list and update the distance to reinitiate the 
process. 

To improve energy efficiency is to minimize energy intake 
of resources such as memory, CPU, network and storage. 
Suggested strategies use the approaches of utilizing limited 
resources effectively. Turning off or bringing to sleep mode 
idle resources for saving energy. Network traffic is kept 
minimum to optimize network energy. To keep network 
traffic minimum VM which have high traffic are shifted on 
same PM. Keeping network traffic minimum less number of 
network element will be in active state this will lead to save 
considerable amount of energy [14]. 

VM on pm are not utilized to the fullest of his capacity in 
[10] author propose a technique called residual resource 
fragmentation. There are different resources but we are more 
interested in processing and memory resources. When a 
resource usage is limited or blocked by some other resource is 
called resource fragmentation. In this paper author basically 
combine the fragmented resource in each single pm and 


combine them by bringing it to the least number of pm. let 
suppose we have two pm its CPU utilization is (80%, 60%) 
and (80%, 95%) if a request is made for creating a new VM of 
above 40% processing it cannot be served. But we have 85% 
processing power combinly on two pm if we combine it we 
can serve a request of creation of new VM above 80%. 

In [15] author proposed a scheme based on virtual machine 
consolidation, reducing the number of active servers by 
consolidating virtual machines dynamically on a minimum 
number of servers, and take advantage of the lacking energy 
proportionality of commonly used hardware. VM 
consolidation is a methodology to maximize resource usage 
while keeping low the energy need in a virtualized 
environment. Most of the paper does not consider network 
energy consumption and cooling energy consumption of a 
data center. Consider both the network data structure and 
cooling energy when consolidating VM. Fever racks and 
routers are utilized without compromising SLA to improve 
energy efficiency. 

C. WORKLOAD CONSOLIDATION 

Data center are physically distributed over a geographical area. 
When a request is received it is put on the nearest 
geographically located data center. This help to reduce 
network delay which in turn reduce turnaround time. Inside 
data center it is tried to fulfil the request using currently active 
PM to save energy. A large percentage of energy is consumed 
as high as 50% more than when server performing at the peak 
power when server in data center are idle [13]. This high 
percentage is because 70% of the time server are idle mean 
under loaded [3]. For saving this energy an effective way of 
choosing the right node for the request processing is needed. 
A strategy is required to put the under loaded node into low 
energy state or turnoff. When to turnoff and when to awake 
these are some of the decision that should be devised in this 
strategy. Similarly, if a new request comes and no active 
server can perform then a new server must be turn on or 
wakeup to fulfil this request. This flexibility is possible by 
virtualization technology to put more than one request on a 
single server to improve resource utilization and decrease the 
resource demand [16] [17] [18]. 

Putting more than one VM on a single pm to optimize 
energy consumption is named VM consolidation. The 
consolidation of VM on fever number of pm is online bin 
packing problem. Where pm is a bin and VM is an object and 
the process is real time so decision has to be quick and 
efficient both in performance and resource utilization 
efficiency [19]. VM are of different sizes according to the 
resource demand and bin have different capacity in term of the 
resources it has. At the time of assigning VM to pm it is tried 
to use the active server instead of turning on or waking up a 
new server. How it will select the pm from multiple on pm to 
put the request on them. One techniques are to put the request 
on the pm which have minimum free space to accommodate 
the request. This technique is used to acquire maximum 
resource utilization by leaving the large free space on other 
pm to be used for another request. However, putting a server 
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into sleep mode and waking up considered an overhead but 
this overhead is tiny [20]. Aggressive consolidation may lead 
to performance degradation as well as energy inefficient. 
Because when a pm performing at the maximum capacity its 
performance efficiency and speedup is degraded and its 
cooling requirement rise up. 

D. PARALELL PROGRAMING 

Multi core systems improve performance by the cost of 
increasing computational energy. Handling fastest processing 
unit in effective way is one of the most complex task in 
computer system. A large number of techniques are available 
for improving the energy consumption of these 
multicore/parallel system. All these techniques improve 
energy consumption [21] by compromising throughput to 
optimize performance per watt. Parallel program came under 


software optimization which is further classified as code 
optimization and runtime optimization. Parallel program deal 
with code optimization. For obtaining energy efficiency 
programmer have a good knowledge of underlying hardware. 

IV. Comparative Analysis of different Energy 
Efficiency techniques 

All techniques have some limitation and strength. There is no 
technique which have good enough up in all aspects. Mostly 
techniques are focusing on the area which is more responsible 
for energy usage like server/storage and cooling which is 
responsible for 50% and 34% power usage of the data center 
respectively. 


Table 01: Comparison of Energy Efficiency Techniques 


Technique 

General comments/Strength 

Limitation 


— Work well for computational intensive tasks. 

— Didn’t perform well for i/o intensive 
task. 


— No overhead of maintaining data center level 

— dependent on SLA not applicable for 


load information applicable to a single server. 

hard quality of service application/task. 
Strict SLA lead to low energy saving. 

DVFS 

— effectively save energy while server is idle and 



server are 70% of the time idle. 

— did not consider the priority of the tasks 
consider only the workload on CPU. 


— turning on a switch off or bringing a sleep node to an 
active state consume more energy as compared 

— quality of service compromised 


using DVFS. 

— no need of data replication because all 
the servers are active using the least 
possible energy 


— Resources are not utilized all the time 70% of the 

— accurate prediction for resource, 


time it is idle. 

resource capacity and resource time for 
which an application required a 
particular resource is not known in 


— using efficient management of resources more than 
50% power can be saved. 

advance. 

Resource 


— resources management is possible only 

Management 

— resource management is the only techniques that 
have the potential of more power saving as 

on accurate resource demand prediction 


compared to other techniques. 

— performance may be degrading depend 
on SLA 


— profit and utilization maximization of infrastructure 

— chances of SLA violation is higher 


is a service. 

— quality of service is maintained 

— data replication for data availability 
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— Avoid overloading 

— increase network traffic 



— delay involve due to network 

Workload 

consolidation 

— balancing the load for energy efficiency and 
performance. 

— turning idle node to sleep or turn off state 

— increase service provider profit by maximum 
utilization of active resources 

— turnaround time increase 

— only consider the workload doesn’t 
consider the priority of the tasks 

— overhead of maintain all the records of 
server’s workload 

— take time to bring sleep or turn off 
server into active state which increase 
chances of SLA violation. 

— using consolidation, we have to 
replicate data for availability 

Parallel 

program 

— Time for processing decrease 

— Throughput increase 

— Energy efficiency achieved by decreasing the 
processing time 

— Depend on the percentage of code in a 
program that can be parallelized. 

— less energy saving is achieved as 
compared to resource management and 
workload consolidation. 


— component idle time is minimized 

— quality of service is maintained 

— not dependent on SLA 

— programmer is responsible for code 
parallelization 


Table 02: energy efficiency effectiveness of different paper on the basis of different parameter 


Ref 

Perf 

consi 

Netwo 

rk 

Perf 

Load 

balancing 

R- 

utili 

Energy efficiency 

Exp 

Setup 

Resp 

time 

Fault 

toleranc 

e 

[ii] 

yes 

No 

yes 

yes 

10% 

Cloudsi 

m 

Decreas 

e 

yes 

[12] 

yes 

yes 

yes 

yes 

NA 

NA 

Stable 

NA 






1.11 kwh less thean 
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[13] 

yes 

yes 

yes 

yes 

des-2-n and 12.78 
from ses-1 

CloudSi 

m 

Stable 

Yes 

[15] 

yes 

yes 

yes 

yes 

25% 

NA 

Decreas 

e 

NA 

[16] 

yes 

No 

yes 

yes 

20.8% static 
22% dynamic VM 
load 

Custom 

Built 

Simulato 

r 

NA 

Yes 

s 

[17] 

yes 

No 

No 

yes 

20% 

Cloudsi 

m 

Stable 

NA 


V. Conclusion 

In this paper we discuss four major energy efficiency 
techniques DVFS, resource management, work load 
consolidation and parallel programing. These four are 
software base energy efficiency techniques we tried to include 
the best and latest techniques for energy efficiency in these 
four categories. In future we will extend this work by deeply 
studding and analysing the resource management and 
workload consolidation techniques in detail. Most of the 
current research based on these techniques. The margin for 
energy efficiency is high in these techniques as compared to 
other techniques 
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Service Level Agreement in Cloud Computing: A Survey 

Usman Wazir, Fiaz Gul Khan, Sajid Shah 


Abstract — Cloud computing provides distributed resources to 
the users globally. Cloud computing contains a scalable 
architecture which provides on-demand services to the 
organizations in different domains. However, there are multiple 
challenges exists in the cloud services. Different techniques has 
been proposed for different kind of challenges exists in the cloud 
services. This paper reviews the different models proposed for 
SLA in cloud computing, to overcome on the challenges exists in 
SLA. Challenges related to Performance, Customer Level 
Satisfaction, Security, Profit and SLA Violation. We discuss SLA 
architecture in cloud computing. Then we discuss existing models 
proposed for SLA in different cloud service models like SaaS, 
PaaS and IaaS. In next section, we discuss the advantages and 
limitations of current models with the help of tables. In the last 
section, we summarize and provide conclusion. 

Index Terms — Service Level Agreement (SLA), Cloud 
Computing. 

I. INTRODUCTION 

C LOUD computing, is a source for providing an elastic 
resources. It is an on-demand computing that gives shared 
resources or applications to the consumer of the cloud [2] . 
Cloud is an elastic source of applications or resources. Cloud 
environment attracted many companies to implement cloud 
environment and make it available for the users to use its 
elastic availability of applications. A large number of 
application have been migrated to cloud platform. 

Virtual Machine (VM) technology is the core technology in 
cloud which enable a physical machine to be split down into 
several VMs. Through virtualization in cloud, operational cost 
minimized through server consolidation and better use of 
computational resources [1]. 

Through advancements in computing world it is nearly 
possible that computing will become the most necessary utility 
of daily life. Cloud computing provides elastic applications to 
the users and to maintain customer-driven service 
management and computational risk management to assist 
Service Level Agreement (SLA). 
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computers. Various companies provides the cloud 
environment such as Amazon, Google, HP, IBM, Microsoft, 
and Sun which aims to provide redundancy and reliability in 
case of failures. Enterprise service consumers require faster 
time in case of global operations which is crucial for the cloud 
to maintain and distribute the workload among different 
clouds at the same time. To establish this kind of computing 
platform, from multiple domains dynamically interconnecting 
and provisioning of cloud resources leads to different 
challenges either within enterprise or across enterprise [4] . 
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Fig: 1. SLA in Cloud Environment 


Due to the large number of market participants and vast types 
of services, market of cloud services are frequently suffer 
from low liquidity, e.g., The expectation of purchasing or 
selling of services, so disadvantaging repelling potential 
customers and new suppliers [5]. Furthermore, there are no 
general standard exists for service level agreement (SLA) used 
in market places. Different SLAs have same meaning but their 
syntax is different accordingly to the requirements of market 
users. 
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SLAs in cloud consists of different specification regarding 
security, pricing, performance, customer level satisfaction 
(CLS) and SLA violation. Before signing SLA make sure that 
the cloud platform you are going to select for your business is 
providing all the requirements which you needed for your 
business. In Section 2, we will discuss SLA regarding issues 
in cloud and key points which is necessary in SLAs in this 
survey [3]. In Section 3, Discussion part will be discussed. 
And in final part Conclusion and Abstract will be added at the 
end of the paper. 

The above is a cloud architecture in Fig 1, which shows the 
overall procedure of users, broker, SLA Admission, Virtual 
machines and Physical machines. In this paper we differentiate 
the SLA models in cloud environment on the basis of 
parameters like Performance, CLS, Pricing, Security and SLA 
Violation in cloud service models i.e. SaaS, Paas and IaaS. 


II. SLA in Cloud Computing 

Cloud computing consists of service models i.e. SaaS, Paas 
and IaaS. 



We will differentiate each model on the basis of SaaS, PaaS 
and IaaS. Furthermore we will expand this tree like what type 
of parameters provided by each model. 

A. SLA in SaaS: 

We will discuss models of SLA in SaaS platform of cloud 
computing and analyze each model on the basis of parameters 
i.e. Performance, CLS, Pricing, Security and SLA Violation. 

SLA in SaaS 


Performance 

Customer 

Level 
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Security 

SLA 

(Response 

time) 

Satisfaction 

(CLS) 



viol ation 


Trying to maximize CLS, in this [6] paper, the author present 
scheduling and admission control algorithm. This algorithm is 
used to increase CLS and also to increase income for the SaaS 
resources providers. The proposed algorithm performed 
different tasks in different scenarios. The results shows 
through simulation of that algorithm in different scenarios 
shows that it performed well trying to achieve CLS. Through 
simulation results display that proposed algorithm ProfPD 
saves upto 40% price of Virtual Machine (VM). In 
comparison with other presented algorithms differentiated 
through different QoS parameters ProfPD provide high profit 
to SaaS provider. There are other two algorithms also 


presented named ProfminVM and ProfRS. These algorithms 
are proposed in case of when user required a quick response 
time. In this case ProfminVM and ProfRS provides much 
better results than other algorithms. 

In [14] this paper a model of computerized negotiation 
framework is proposed which provides automatic negotiation 
facility. This negotiation framework provide decent bargaining 
of SLAs. This decent bargaining is established among 
different SaaS brokers and different companies which aim to 
achieve their target goals. To achieve CLS and to increase 
profit they provide technical strategies and individual choice 
making heuristics mechanism that remember different trade- 
off parameters, different constraints regarding marketplace, 
time and other different QoS parameters. The facts used in 
negotiation heuristics framework is derived through 
experimental studies which was taken from an actual cloud 
enterprise. Their proposed algorithm display a limit on cost 
and also increase CLS in comparison with currently used 
heuristics. The selection of cloud resource provider is done 
though broker. Selection of a good provider and 
communication overhead takes place when parallel 
negotiation is done. 

In [15] there are two policies are discussed which are used to 
handle the SLAs regarding its relationship with the goals of 
commercial enterprise and maximizing revenue. To set up the 
classification of customers is according to the relation between 
user and provider and measure the other QoS parameters that 
consumer wants to purchase a low QoS contract or greater 
QoS contract by paying high price. It is not necessary to 
consider or accept the policies as a complete set. There are 
some drawbacks involve in accepted policies. 

In an untrusted cloud for dynamic groups, in [20] propose a 
scheme of secure data sharing known as Mona. In group 
without revealing identity privacy to the cloud a client is able 
to share data with others through Mona. Moreover, Mona 
supports efficient new client joining and client revocation. 
More especially, before participation a new clients in the 
cloud can directly decrypt a stored files and through a list of 
public revocation they achieved efficient user revocation with 
no updating of private keys for other clients. Additionally, the 
encryption computation price and the overhead of storage are 
constant. The large analysis display that their presented 
method satisfies the desired guarantees efficiency and security 
requirements very well. 

B. SLA in IaaS: 

We will discuss models of SLA in IaaS platform of cloud 
computing and analyze each model on the basis of parameters 
i.e. Performance, CLS, Pricing, Security and SLA Violation. 

SLA in IaaS 
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A cloud service provider have argue the profit optimizing 
dilemma in cloud system and they have an agreement among 
short term (decreasing the incoming jobs rejection) and long 
term (maintain SLAs and QoS) profitability. They present 
SLA aware admission control algorithm for cloud system 
which don’t affect the cloud service provider’s short term 
targets and increase the profits of long term. The profit 
enhance rate is among 80% and 400%. And they also compare 
his model with ‘M/M/m/m+r’ queuing model of fixed size [7]. 
In [10] there is a system called Learning Automata (LA) is 
introduced which deals with different QoS parameters in IaaS 
cloud platform. Learning Automata is very useful in dealing 
with computational tasks. LA based system ensures that CSP 
meets with all the requirements that is mentioned in the SLA 
agreement. Through various experiments QoS metrics is 
established. Through LA system that is used to select 
resources from the cloud in order to achieve QoS parameter is 
done through parallel execution speedup, job priority and 
response time. 

In [23] they proposed a novel architecture called DeSVi, 
which is used to detect the SLA violation in cloud platform. 
The elements involved in their architecture are automated VM 
deployer, which is responsible for telling that execution of 
user application is done and LoM2HiS framework, which is 
used to track packages execution and convert low level SLAs 
in to high level. 

In [24] they try to find out the two different user’s hidden 
requirements, which are not known to the CSP, in order to 
achieve CSL. So new method is used to estimate their 
requirements. Through this technique Cloud provider try to 
minimize the SLAs violation and to increase customers trust 
level. They carried out an experiment that during significant 
situation when resources are allocated, the estimation of 
resources can help the CSP to known about users requirements 
and analyze that which user to be served and which can be 
rejected. This will maximize CSL and CSP profit. This 
method is introduced to minimize SLAs violation. 

C. SLA in SaaS, PaaS and IaaS: 

In this paragraph, we will discuss that models which provides 
services of SaaS, PaaS and IaaS in cloud computing and 
analyze each model on the basis of parameters i.e. 
Performance, CLS, Pricing, Security and SLA Violation. 


SLA in SaaS, 
PaaS & IaaS 
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To overcome the issue of resource provisioning in datacenter 
that execute multiple form of applications workload, specially 
transactional and non-interactive applications. Here in [8] 
scheduling and admission control method is presented not 
only to maximize the profit and resource utilization, but also 


guaranteed the customer level QoS satisfaction are achieved at 
SLA. From their experimental result, for better resource 
utilization and provisioning of datacenters they suggest that 
this method is very helpful to aware from other multiple kinds 
of SLA along with mix of workload and appropriate penalties. 
The presented method decreases the violations of SLA and 
provides significant improvement over the consolidation of 
static server. 

In [9] a new method is proposed called SLA aware service 
(SLAaaS) in Cloud computing. In this method QoS is merged 
with SLA and his goal is to decrease cost and to improve 
performance. CSLA language is introduced to define the SLA 
figures efficiently and addressing SLA violation. It ensures the 
best SLA guarantee and defined SLO (service level objective) 
as well as the trade off in SLA, priority of resources and 
weight among them. For this purpose an online algorithm is 
used in order to achieve the SLOs and to guarantee the service 
level agreements. They did not focus on other things like 
throughput of service etc. they did not target the other QoS 
parameters like privacy in cloud service and security 
guarantees. 

In [11] architecture is proposed that enables Quality of Service 
(QoS) aware configurations, which enables the system with 
network federation integration to different types of cloud 
middle wares. It achieves the Quality of Service through 
virtualization on cloud base software define networks (SDN). 
Interface is proposed on the basis of SLA, allocation of 
network resources by the highest layer on the basis of QoS 
parameters that is enforced by the bottom layer. That is refer 
to the model which is “receive which you pay” e.g. allocation 
of memory and CPU through virtual hosts. 

In unreliable resources context like spot instances to balancing 
the monetary cost versus reliability in [12] effective bid 
decision making strategy is proposed. The proposed strategy is 
known as AMAZING. The proposed strategy exploits the 
intelligence between different Spot Instances about state 
transition to facilitate decision making throughout the job’s 
computation course. They formulate the problem of 
optimizing decision making as a Constrained Markov 
Decision Process (CMDP). Before the job’s computation is 
completed for each instance hour optimal bid decision is 
applied after solving the AMAZING, CMDP. From their 
analysis results it is confirm that AMAZING give better 
performance as compare to early works in both terms like 
monetary cost and execution time. 

In competitive cloud market the service provider should select 
an optimal penalty policy to achieve the maximum profits. In 
[13] they first survey the methods for penalties calculations of 
cloud service providers. According to the survey, they present 
for cloud service providers a correspondent penalty base profit 
maximization mechanism and a competitive penalty approach. 
Based on this approach, during the game procedure to achieve 
the maximum expected profit every cloud service provider 
would select the better fit penalty policy. 

In [16] they proposed an approach called negotiation 
approach, which is used in order to achieve time utilization 
and to increase the profit of cloud providers. Through this 
approach the provider capture the consumer’s choices and 
strictly managed the deadline restrictions, which leads to 
increase the reliability on provider. This approach gives 
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negotiation model for both parties in order to discuss their 
SLA efficiently. 

In [17] they proposed a multi problem negotiation model. This 
model is used to facilitate the following figures. 1) Tradeoff 
among price and time- slot utilities and 2) PTNs (Price and 
Time-slot Negotiation) among cloud agents. The agent make 
multiple proposals in the negotiation round which leads to 
establish similar aggregate utilities, just individual time-slot 
utility and cost are different. It provide novel tradeoff 
algorithm, known as the “burst mode”, which is used for 
resource reservation, resource allocation in different time slots 
proposal. 

In [18] they proposed an algorithm derived from different 
machine learning algorithms. This algorithm is used to find 
the semantically equal SLA elements from different SLAs. 
This technique provide to utilize the resources for the 
consumer of the cloud by allowing automatic selection of the 
required best services offers. A framework is developed for 
automated selection of SLA, its creation and management and 
simulation based evaluation is performed. Simulation based 
results provided which tells that their technique is efficient. 

In [19] the authors present the analysis of access control 
requirement with detail for cloud networking and significant 
gaps discovered, those are not fulfill through traditional access 
control mechanism. To identifying the access control 
requirements of cloud they recommend access control model 
but the authentication mechanism and risk engine are 
implemented before a proposed model will be evaluated and 
implemented. They also implement risk engine with its 
components which deal with dynamic behaviors and also 
implement authentication approach which deal with large 
space and high time complexity. 

The negotiation method and description mechanism of privacy 
information is presented in [21]. First, by description logic 
base they explain property of privacy with Privacy 
Negotiation Language (PNL). Second, by pre-negotiation 
among service provider and client they get privacy attribute 


sequence. Third, to satisfy both parties they achieve privacy 
policy based on exchanging privacy disclosure assertion. In 
this paper for client data protection they offers a theory basis 
and implementation technique in cloud system. At the end, 
they present an algorithm for privacy policy negotiation. 

To facilitate the selection of competent and trustworthy 
service provider in [22] the authors recommend a new 
framework which is known as SelCSP. The trustworthiness is 
estimated by proposed framework in term of reputation 
feedbacks, context- specific and dynamic trust. The service 
provider competence is also computed in kind of SLAs 
transparency by framework. To estimate the risk of 
interaction the above two entities are combined. In this 
situation of interaction such estimate makes able a client to 
select a services provider. 

To minimize service level agreement violations and cost of 
infrastructure for software as a service provider in [25] three 
cost driver resource allocation algorithms are presented. The 
presented three algorithms are intended to make sure that 
software as service providers capable to mapping client 
request for infrastructure level parameters, handling the virtual 
machine heterogeneities and manage the customer’s dynamic 
changes. The proposed algorithms for both software as a 
service provider’s and clients perspective consider different 
QoS parameters like a penalty rate, arrival rate and service 
initiated time. The simulation result shows that their proposed 
algorithms minimized the cost of the software as a service 
provider and rate of service level agreement violations in 
cloud resource sharing environment. 

III. Analysis of Models 

In this section we will analyze all the models proposed for 
SLA in cloud computing. We will analyze the different 
parameters which are addressed by the different models in 
order to ensure the CSL, Security, Performance, SLA violation 
and to increase the provider’s profit. 


TABLE I 

Analyzing Models Which Provides SaaS Platform 


Paper 

Ref. 

Propose model 

SLO 

(service level 
objectives) 

Profit 

Customer Level 
Satisfaction 

Performance 

Security 

SLA violation 

[6] 

Admission control 
and scheduling 
algorithms 

Profit, Cost, 
Customer Level 
Satisfaction 

Maximize, 
save 40% 

Maximize 

Quick response time 

Nil 

Nil 

[14] 

Novel automated 

negotiation 

framework 

SLA negotiation, 
resource allocation; 

50% 

increased 

profit 

Improve 60% 

Availability Improve 
60%, Resource 
allocation 

Nil 

Nil 

[15] 

Two sets of policies: 
Revenue 
Maximization 
Or Classification of 
clients. 

Cost, Profit, 
Relation between 
client and provider 

Less 

Maximize 

Cloud provider is to 
maximize its 
economic profit and 
Client Classification. 

Nil 

Nil 

[20] 

Mona scheme 

Security, data 
sharing, privacy- 
preserving, access 
control, dynamic 
groups 

High 

Guarantees 
efficiency as 
well. 

Nil 

High 

Nil 
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TABLE II 

Analyzing Models Which Provides IaaS Platform 


Paper 

Ref. 

Propose model 

SLO 

(service level 
objectives) 

Profit 

Customer Level 
Satisfaction 

Performance 

Security 

SLA violation 

[7] 

SLA aware 
admission control 
algorithm 

QoS, SLA base 
admission control, 
profit 

Maximize, 
ranges 
between 
80% and 
400%. 

Maximize 

Keeping the response 
delays 

Nil 

Nil 

[10] 

Learning Automata 
(LA) -based QoS 
(LAQ) framework 

QoS parameters 

Nil 

Better response 
time 

Parallel execution 
speed up, and job 
priority 

Nil 

Nil 

[23] 

LoM2HiS 
framework for 
resource monitoring 

Obligations, 
pricing, and 
penalties of 
violations. 

No cost for 
missing SLA 
violation 
detection 

Nil 

Nil 

Nil 

Monitor and detect 
SAL violation 

[24] 

New proactive 
resource allocation 
approach 

Satisfaction level, 
SLA violation 

Gain 

profitability 

Improve users’ 

satisfaction 

level 

Maintain response 
time 

Nil 

Decreasing 
impact of SLA 
violations. 


TABLE III 

Analyzing Models Which Provides SaaS, PaaS and IaaS Platform 


Paper 

Ref. 

Propose model 

SLO 

(service level 
objectives) 

Profit 

Customer Level 
Satisfaction 

Performance 

Security 

SLA violation 

[8] 

Admission control 
and scheduling 
method 

Virtual machine 
migration, 
Resource 
management and 
user require QoS. 

Maximize 

Maximize 

Maximize 

Nil 

Resource 
utilization, 
decreases the SLA 
violations 

[9] 

SLAaaS 

&CSLA 

Language 

Addressing SLA 
violations, cost, 
dependability, 
response time 

Maximize 

Higher-level e- 

commerce 

service 

Quick Response 
Time 

Nil 

SLA violation 

[11] 

Architecture that 
enables Quality of 
Service (QoS)aware 
configurations 

Quality of Service 
through 

virtualization on 
cloud base SDN. 

High Profit 

Nil 

Better response time 

Nil 

SLA violations 

[12] 

Effective bid 
decision making 
strategy known as 
AMAZING 

Bidding 
Strategy, 
user reliability, 
cost 

Maximize 

Nil 

Minimize 

job completion time 

Nil 

Nil 

[13] 

Competitive penalty 
model and penalty 
based profit 
maximization 
algorithm for cloud 
providers. 

SLA, availability, 
penalty degree 

Maximize 

profit, 

Maximize 

High availability 

Nil 

Competitive penalty 
model 

[16] 

Time dependent 
negotiation model 

Reliability 

More profit 
for 

providers. 

Nil 

Reliable & 
Availability 

Nil 

Nil 

[17] 

Novel tradeoff 
algorithm, known as 
the “burst mode” 
proposal, 

Price and time slot 

Maximize 

Nil 

Resource reservation, 
Resource allocation, 
different time slots. 

Nil 

Nil 

[18] 

Approach for 
automatic 
SLA matching 
and 

Provider selection. 

SLA mapping, 
SLA matching, 
provider selection, 
machine learning, 
autonomic 
computing, 
electronic markets 

Maximize 

Maximize 

Nil 

Nil 

Automatic SLA 
management 
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[19] 

Access control 
model 

Security 

High 

Maximize 

Nil 

Access 

control 

high 

security 

Scalable 

[21] 

Algorithm for 
privacy policy 
negotiation 

Description logic 

Privacy, 

property, 

Privacy policy, 
Privacy preference 

Nil 

Nil 

Nil 

high 

Nil 

[22] 

SelCSP framework 

Service provider, 
trust, relational risk, 
performance risk, 
service level 
agreement, control, 
transparency 

Nil 

Nil 

Nil 

Security 

guarantees 

Minimize SLA 
violation 

[25] 

Profmin, VMmin 

AvaiSpace 

algorithm 

SLA; Resource 
Allocation; 
Scheduling. QoS 
such as response 
time, and initiation 
time. 

Nil 

Improve 

Maintain response 
time 

Nil 

Minimize SLA 
violation 


IV. Conclusion 

In this paper, we surveyed various models used for SLA in 
cloud computing environment. Some of the models can 
provide high level security measures for consumer’s data, 
while some of the models provide penalty on SLA violation. 
Some of them increases user’s trust level while some of them 
maximize their performance level as compared with other 
models. To establish SLA between consumer and cloud 
service provider, we need to understand the role of cloud 
service provider either the CSP can provide all the required 
services according to the user’s choice? Because User 
expecting from cloud service provider to provide all the 
necessary services for their data. For every CSP, it is very 
difficult to provide security for user’s data to ensure 
confidentiality, integrity, reliability, availability and privacy. 
In this survey, we discuss some of SLA parameters for 
consumers that must consider these parameters before signing 
SLA in cloud platform. 
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Blind watermarking algorithm for 3D 
multiresolution meshes based on spiral scanning 

method 
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Abstract — 3D mesh is a new data type appeared in the last 
decades. Since its emergence, it has been used in several areas 
which raise major security problems. As a solution, we propose 
a blind watermarking algorithm for 3D meshes. For doing spiral 
scanning method decomposes the mesh into GOTs (a Group 
Of Triangles). At each time, only one GOT will be uploaded 
into memory. It undergoes a wavelet transform to generate 
vector of wavelet coefficients. This latter undergoes modulation 
then embedding steps using data coded with BCH code. Once 
watermarked, the next GOT will be uploaded. This process 
stopped when the entire mesh is watermarked. Experimental 
tests show that the quality of meshes is kept despite the high 
insertion rate and that memory consumption is reduced. As 
for robustness, our algorithm overcomes the following attacks: 
translation, rotation, smoothing, uniform scaling, coordinate 
quantization, noise addition, simplification and compression. 

Index Terms — Digital watermarking, 3D meshes, Multi- 
resolution, Wavelet transform, Spiral scanning, Attacks, Com- 
pression. 


I. Introduction 

D URING the last decade, the flow of 3D objects is 
increasingly used everywhere. This is due to the quality 
and accuracy of 3D models that makes them increasingly 
indistinguishable from concrete objects. The strengthening of 
computer graphics and acquisition techniques has generated a 
variety of areas benefiting from this new data category. Indeed, 
3D meshes become ubiquitous in industrial applications such 
as movies production where 3D models are used in animated 
movies and live action feature films [1]. Towards the 2000s, 
the new 3D technology led the gaming industry to move 
away from 2D and pseudo 3D games (simulating 3D with 2D 
graphics projections), and to use 3D game engines leading to 
an abundant use of 3D models. The mechanical environment 
is another example as it benefit from 3D objects to perform 
virtual prototypical models. Let us add the use of this data 
type in architecture ( to see the final impact of such a 
project) and medical visualization ( abundant use of MRI, 
ultrasonographicand radiography) domains. In particular, the 
idea of multiresolution plays a growing part in the field of 

1. Sayahi is with REGIM-Lab.: REsearch Groups in Intelligent 
Machines, University of Sfax, ENIS, Sfax,3038 Tunisia, e-mail: 
phd . ikb el. s ay ahi @ ieee . org . 

A. Elkefi is with REGIM-Lab.: REsearch Groups in Intelligent 
Machines, University of Sfax, ENIS, Sfax, 3038 Tunisia, e-mail: 
akram.elkefi.tn@ieee.org. 

C. Ben Amar is with REGIM-Lab.: REsearch Groups in Intelli- 
gent Machines, University of Sfax, ENIS, Sfax, 3038 Tunisia, e-mail: 
chokri .benamar @ ieee . org . 


geometric modeling and especially scientific visualization. The 
initial objective, which remains the principal today, is the 
simplification of complex and dense mesh, particularly those 
derived from subdivision techniques or acquisition by 3D 
scanner. 

On the other side, the information technology revolution, that 
has affected mainly the telecommunications and networks, 
has made its own way. The result being is the emergence 
of high speed broadband networks allowing the storage and 
the transfer of digital documents through remote multimedia 
databases. The 3D models are an example of these documents 
shared via the net. Sharing 3D meshes between remote users 
has spawned a huge security problem especially that digital 
copying does not entail any loss of quality. In addition, in 
contrast to acts of counterfeiting of analog works, the digital 
reproduction costs are negligible and counterfeiters can act 
anonymously without leaving a trace of their passage. All these 
problems leads that legal protection is no longer sufficient to 
ensure alone the peaceful management of works transmitted 
to the public. Hence, the need to use other techniques to 
strengthen existing legal protections seems necessary. Digital 
watermarking is then announced as a new technique that aims 
to limit these ’’digital abuse” and to preserve the copyrights 
[ 2 ] 

In this context, many works have recently appeared to secure 
meshes ’copyright. The main topic of 3D watermarking is 
the choice of the embedding strategy (Where can we embed 
information? and how can we do it?). To answer these two 
questions, many 3D watermarking algorithms were published. 
Sharvari et al in [3] and Chao-Hung Lin et al in [4] chose to 
insert information in spatial domain by modifying geometric 
information. Zhiyong et al [5], combined spectral and spatial 
domain during embedding to improve robustness and invisi- 
bility. Multi-resolution domain is also present in the proposed 
watermarking algorithms. To transform host mesh from spatial 
to multi-resolution domain, there is an application of a wavelet 
transform. Works published in [6], [7] and [8] are examples 
that embed information into wavelet coefficients. 
Unfortunately, all this diversity of methods does not deny that 
3D watermarking is ’’still far from the level of maturity of 
other watermarking technologies for audio, video or image 
[2]”. In fact, a perfect solution for 3D watermarking has not 
yet been proposed. In this context, we propose a new blind 
watermarking algorithm for 3D multi-resolution meshes. Our 
goal is no longer to have a good compromise between insertion 
rate, robustness and invisibility only, but also to minimize the 
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hardware resources used during the execution of our algorithm. 
This may facilitate the implementation of our prototype as well 
as the handling meshes with high definition. 

II. Research status 

Since the appearance of the first watermarking algorithm 
for 3D meshes by Ohbushi et al in [9], many works were 
proposed in order make improvement in this research axis. 
The general structure of a watermarking scheme is always the 
same. In fact, each 3D watermarking scheme must necessarily 
go through the following three steps: the insertion of data, 
dissemination of the marked document in a noisy environment 
and extracting the signature from the watermarked and often 
attacked mesh [10]. Unfortunately, it is not obvious to correctly 
extract embedded information. In fact, attacks applied may 
alter and even destroy the already inserted data. 

Therefore, 3D watermarking algorithm targeting 3D meshes 
have continued to appear attempting to make improvements 
which create a diversification in terms of adopted areas and 
used techniques during embedding information into meshes. In 
order to classify these works, we choose to adopt two different 
criteria: the embedding domain and the type of information to 
change during insertion. 

The first embedding domain used was the spatial one. The idea 
is to work directly on the mesh without any transformation. 
At this level, two types of information can be targeted by 
insertion. The first is the geometric information. In fact, to 
insert data into the mesh, Xiao et al [11] and Jen-Tse et 
al [12] used the Cartesian coordinates of each vertex. The 
watermarking algorithm, too, published by Hintandra et al in 
[13] worked also on the spatial domain and more precisely 
on geometric information. The proposed idea was to move a 
selected set of vertices from their positions depending on the 
data to be inserted. This selection is based on the principle 
of IEEE754 floating point representation. The second target is 
the topological information as in [5] and [4]. Insertion, in this 
case, aims at changing connections between vertices. 
Although working in spatial domain is slightly complex, given 
that no transformation will be applied on the mesh, results 
show a failure in visibility criterion. Indeed, working directly 
on geometrical and topological information has a considerable 
influence on the deterioration of the mesh quality. 

Due to improvement attempts, other insertion domains have 
been adopted, such as frequency area. The idea is to ap- 
ply transformations to the mesh, before embedding step, 
to present it in the frequency domain using the so-called 
’’frequency coefficients”. Many tools allow ensuring this new 
representation. Xiangjiu et al in [14] chose to apply the mesh 
Radial Basis Functions before embedding. Insertion, in this 
case, takes place by changing the low frequency coefficients. 
Manifold harmonics is a tool used by Jinrong et al in [15] to 
work in the frequency domain. Applying Manifold Harmonics 
Transformation allows the generation of spectral coefficients. 
These latter will be modified according to the message to be 
inserted. 

As was the case for image and video [16]— [18], multi- 
resolution domain is no longer excluded from watermarking 


works for 3D meshes. Works in [19] and [3] are examples 
of works operating in this domain. The used tool is the 
wavelet transform allowing the generation of multi-resolution 
coefficients called ’’wavelet coefficients” which are targeted 
by embedding. Approach Ouled Zaid et al in [7] targeted 
these coefficients during insertion. The modification is based 
on QIM algorithm to improve the amount of information to 
be inserted. 

As a conclusion, we can say that there is a diversity of used 
methods and tools in watermarking algorithms targeting 3D 
meshes. Unfortunately, despite the efforts to innovate and 
improve in this field, the security problem of 3D mesh is 
always posed. As shown in table I, a perfect solution has not 
yet been proposed. ST reminds the similarity transformation 
attacks including translation, rotation and uniform scaling. 
Results presented in I reveal a failure in robustness criterion. 
Indeed, there is no solution that resists to all attacks targeting 
3D meshes. The compression attack, frequently applied to 
3D meshes, is absent in results published in recent works. 
This suggests that compression presents a real challenge for 
3D watermarking. In addition, the watermark used during the 
insertion is relatively short. This limited the number of bits 
influence on the amount of information to be hidden in the 
mesh. 

The failure already unveiled amounts to two reasons. The first 
is the complexity of handling 3D meshes compared with other 
types of data such as sound, image or video. The complexity 
of representing 3D meshes and the sensitivity of these data 
allowed concluding that the manipulation of the 3D mesh 
is a difficult task. This return to the following 3D mesh 
characteristics: 

• The absence of a global parameterization, 

• The arbitrary topology, 

• The irregular connectivity, 

• The non-uniformity of sampling. 

The second reason is the difficulty of finding a good compro- 
mise between robustness, visibility and insertion rate which 
can be justified by the strong link between these 3 criteria. In 
fact, the increase of the amount of data to be inserted causes 
either a serious deterioration of the mesh quality or reduces 
the level of robustness. 

III. Backgrounds 
A. Multi-resolution Analysis 

In the 3D world, an object can be defined by a cloud 
of points having 3 coordinates x, y and z in the Cartesian 
coordinate system. The surface of a 3D object is presented 
by a grid. This latter is partitioned into a set of polygonal 
elements. Polygons composing the mesh can be either 
triangles, or quadrangles or any other polygons. Especially, 
triangular meshes are used in this work. 

This category is described using a set of triangles. The points 
of the triangles are called vertices and they are interconnected 
by the edge forming the triangles. The vertices refer to the 
geometric information of the grid, and edges refer to the 
topological information. 

A mesh can be mono or multi-resolution. In the case of 
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TABLE I 

Insertion rate, Visibility and Robustness in recent 3D watermarking works. 


Work 

Blind 

Insertion rate 

Invisibility 

S. T 

Noise 

Smoothing 

Quantization 

Simplification 

[11] 

yes 

- 

- 

+ 

- 

- 

- 

- 

[3] 

No 

+ 

- 

+ 

+ 

+ 

- 

- 

[5] 

Yes 

- 

+ 

+ 

- 

- 

- 

+ 

[4] 

Yes 

+ 

+ 

- 

- 

- 

- 

- 

[19] 

Yes 

+ 

+ 

+ 

+ 

- 

+ 

- 

[12] 

Yes 


+ 

- 

- 

- 

- 

- 

[14] 

Yes 



- 

- 

+ 

+ 

- 

[15] 

No 

- 

- 

+ 

+ 

- 

+ 

- 

[13] 

No 

- 

- 

- 

- 

- 

- 

- 

[7] 

Yes 

+ 

- 

+ 

+ 

- 

+ 

+ 



Fig. 1. Multi-resolution Analysis. 

multi-resolution, a 3D object is represented at different levels 
of details. It is composed of a coarse mesh and a set of 
information which refine the coarse mesh to its finest levels 
version (see figure 1). Multi-resolution meshes are stored in 
files with a .dat extension. So far, no watermarking work has 
targeted multi-resolution meshes stored in DAT format files. 
This is due to the sensitivity of handling this category of data. 
In fact, the treatments applied to the mesh can easily alter its 
multi-resolution appearance. Our goal is then to watermark 
this kind of meshes without influencing either its quality or 
its multi-resolution appearance. 


B. Digital Watermarking 

By definition, digital watermarking [20] is a technique that 
involves inserting a watermark, in the form of a message or 
logo, in a digital document known as ’’host document.” The 
appearance of this technique was the result of an intense need 
to secure the shared flow of digital data. Once inserted, it 
must be possible to retrieve the watermark after the transfer 
of this document. Inserted data may contain an identification 
number to implement a system of copyright or a description 
of the content in order to indexing and it must be indelible 
[21]. That is to say that once inserted, it must be impossible 
to remove watermark without using the key or the insertion 
method. On the other hand, inserted data must be robust to any 


attack targeting the host document. The 3 steps composing a 
3D watermarking algorithm are the following: 

• Insertion: This step uses the host mesh and the watermark 
to be inserted. Depending on the application which is 
dedicated to the 3D watermarking, the insertion can be 
made either in spatial domain (directly on the mesh) or 
in the transformed domain ( frequency, spectral or multi- 
resolution domain) [22]. Inserting data in the mesh can be 
made following an additive (add information to the mesh) 
or a substitute scheme (substitute some components of the 
mesh). 

• Dissemination: Once the mesh is watermarked, it may 
undergo different treatments. The most frequent being: 
storing in a database, transferring in a computer network 
or even a set of treatments to specific improvements. 
These can give opportunity to attacks to change the 
watermarked mesh and consequently alter the data being 
inserted. 

• Extraction: Having received the watermarked mesh, the 
extraction step begins. It involves extracting data from 
the watermarked and usually attacked mesh. In the case 
where the extraction uses only the watermarked mesh, 
the watermarking algorithm is called ’’blind”. 

C. Wavelet Transform 

The main idea of the multi-resolution analysis is to decom- 
pose a mesh Mi in two sets: a low resolution mesh 
grosser and a set of details A_i (see figure 2): the analysis 
phase. All these details and meshes of different resolution level 
are then used to reconstruct the original mesh: synthesis phase. 
Using the formalism of multiresolution analysis, we can write 

Mj = Mj_i 0 Dj — i (1) 

Here Dj- \ is all the details necessary to rebuild the mesh, 
Mj is the higher resolution from the mesh Mj- 1 , and 0 is 
the orthogonal complement operator. The principle of wavelet 
transform is to decompose the energy of a signal using two 
basic functions (prediction and update). Thus, applying these 
functions on a mesh in an analysis step, we obtain a lower 
resolution mesh and a set of wavelet coefficients needed to 
reconstruct the original mesh in the synthesis step. All these 
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v^v 

O* ^ D* , D’ 

M U M M 

(a) : Analysis phase 

Xf° M % M% L>* 

(b) : Synthesis phase 

Fig. 2. Wavelet Transform Decomposition. 


coefficients are assembled into a single vector called wavelet 
coefficient vector (WCV) as shown in equation 2. Especially, 
this vector will be modified during insertion. 


(Di\ 


WCV = 


W 


( df d\ d \ ^ 
\ df d\ d\ ) 


( 2 ) 


For doing, a lifting schema, which is a wavelet transform 
of second generation introduced in 1998 by Sweldens [23], 
should be used. It consists in exploiting the spatial and 
frequency correlation present in the mesh to reduce its entropy. 
As shown in figure 3, the lifting scheme is divided into three 
steps: 

• Poly-phase transformed: it is a basic operation that di- 
vides the signal to two sub-bands. 

• Prediction: This step exploits the spatial and frequency 
correlation present in the signals. During this step the 
elements of the first component are used to predict 
the elements of the second component. The difference 
between the predicted element and the element present 
in the same second component is the detail. 

• Update: at this stage the first component element rep- 
resent only a sub- sampling of the original signal. To 
transform these elements to low frequencies, we apply 
a low pass filter. 




Fig. 4. Butterfly filters: (A) shows the different types of points, (B) prediction 
filters. (C) the update filter . On each type of points will be applied filter 
corresponding along the axes of orientation. 


• The insertion of information, by modifying the wavelet 
coefficient’s vector, is done at different levels of resolu- 
tion. This eliminates significantly the interaction between 
inserted data [6]. 

• Transforming the mesh from its spatial representation can 
greatly increase invisibility and robustness seen that we 
does not manipulate directly topological and geometric 
information of vertices. 

• Wavelet transform is one of the methods that compress 
practically without losses. Their use allows increasing 
the robustness of our algorithm against the compression 
attack. 

D. Spherical coordinate system 

To insert information into a mesh, we should calculate 
the module of each wavelet coefficient. For this reason, a 
transition to the spherical coordinate system is necessary. 
To transform the Cartesian coordinates (x, y and z) of each 
wavelet coefficient into spherical coordinate system (p,0 and 
p), formula 3 is used. 



p = ^Jx 2 + y 2 + z 2 
0 = arccos(-) 
arccos 


p = 


2 x n — arccos 


\fx 2 +y 2 

x 

\/x 2 +y 2 


( 3 ) 


Fig. 3. two-channel lifting scheme: P and U represent operators of prediction 
and update. 


To recalculate the wavelet coefficients, the application of an 
inverse transformation is necessary after insertion. For doing, 
formula 4 is applied as follows: 


Especially, in our work we use an M-channel lifting scheme. 
M varies according to the number of mesh resolution levels. 
Our schema is based on the use of butterfly filters. These latter 
are composed of three prediction filters and an update filter. 
The prediction filter is the same but oriented differently. As it 
is presented in figure 4, the choice of prediction filter depends 
on the point on which the filter will be applied: for the red 
point, a red filter is applied and so on. The prediction of a 
point is a weighted sum of its neighborhood coefficients. The 
choice of working in multiresolution domain is due especially 
to these reasons: 


x = p x sin 0 x cos p 
y = p x sin 0 x sin p (4) 

z = p x cos 0 

E. BCH error correcting code 

The BCH code (code of Bose, Chaudhuri, and Hoc- 
quenghem) [24] is a block error correcting code intended for 
binary data. This code allows the correction of t errors with t 
between and |. Let n, the codeword length, be equal to 
2 m — 1. k refers to the number of bits to encode and m is the 
number of control bits. Abbas et al. affirmed in [25] that the 
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BCH code is able to correct a number of errors up to 25% of 
the number of bits transmitted. This encouraged us to involve 
this code in our watermarking algorithm. 

1 ) Galois Field ( GF): For a Prime number q, a Galois field, 
characterized through defined arithmetic operations, refers to 
the set of integers modulo q ensuring the following proprieties: 
Property 1: Let us consider the irreducible polynomial of 
degree m, named p(x), representing GF (g m ). p (x) does 
not contain roots in GF (q) but we assume that it has roots 
elsewhere. We note a one of these roots. Thus, the set of roots 
that we can handle is a 0 , a 1 , ..., a q -1 . 

Property 2: if a is a root in GF ( q m ) then: 

a l X a j = a d+j)rnod(2 m -l) ( 5 ) 


memory consumption is due to the decomposition of the mesh 
while being watermarked using the spiral scanning method 
which is the main idea of this manuscript. The minimization of 
memory consumption is due to the decomposition of the mesh 
while being watermarked using the spiral scanning method. 
As shown in Figure 5, our work is based on the use of 
a watermarking buffer. This refers to the maximum amount 
of memory reserved for watermarking. Decomposition of the 
mesh into GOTs depends on the watermarking buffer size. For 
each GOT overloaded in memory, watermarking which can 
be either an insertion or an extraction step is applied. Once 
treated, the next GOT is overloaded. This process ends when 
the entire mesh is treated. 


2) BCH encoder: To encode a message using the BCH 
code, we had to use a predefined polynomial said ’’generator 
polynomial”. The codeword is resulting from a multiplication, 
in Galois field that owns the code, of the generator polynomial 
with the message that we aim to code. 

3) BCH decoder: The decoding step using the BCH code 
follows these sex steps: 

Stepl: Let p(x) the polynomial that corresponds to the code- 
word. The syndrome Si = p(a 1 ),S 2 = p(<Y 2 ),52t = p{a 2t ) 
should be calculated. 

Step 2: If the syndrome is zero, the received message is 
correct. The execution of the algorithm stops. If not, the rest 
of the algorithm is executed. 

Step 3: The number of errors that occurred noted S can be 
calculated. It corresponds to the rank of the following matrix 
(see formula 6): 

(Si S 2 ••• St-1 St \ 

s 2 s 3 ■■■ St St + 1 


3D mesh 



End of Watermarking (scanning request) 


Fig. 5. The proposed watermarking schema based on spiral scanning method. 


A. Embedding 

Embedding involves inserting data in the host mesh while 
keeping its quality. This step requires then the presence of a 
multi-resolution mesh and a watermark in the form of a binary 
sequence. As already mentioned, embedding is applied to each 
GOT saved in memory. To further detail this step, we present 
figure 6 which shows a zooming of the watermarking part in 
figure 5. The steps necessary for watermarking a GOT are: 


\St S t+ 1 


s 2t - 1 

s 2t - 1 ) 


( 6 ) 


Step 4: Following the number of errors, system in equation 7 
is solved. 


(Si S 2 

s 2 s 3 


\St St+1 


St-1 

St 


St \ 

St + 1 

Sit- 1 


Sit- 1 J 


C(5+l\ 

C(5+2 


V C25 / 


5 

SI 


( 7 ) 

Step 5: The S roots a~ l1 , ...,a~ zS of the polynomial M (x) 
are determined. M(X) = ( 6 X S + ... + + 1. 

Step 6: The errors occurred in positions These bits 

should be reversed. 


' 



Fig. 6. Embedding Step. 


IV. Approach description 

We present, in this paper, a new blind watermarking ap- 
proach for multi-resolution 3D meshes. Our goal is to find 
a good compromise between invisibility, insertion rate and 
robustness while minimizing the amount of memory used 
during the execution of our algorithm. The minimization of 


1) Spiral Scanning acquisition: To treat an object during 
its acquisition, we must break it down into parts and send 
the necessary part to memory each time. Contrary to semi- 
regular mesh, decomposition of image into blocks or video into 
image looks very simple. The problem that arises during the 
decomposition of a mesh is the choice of the starting point and 
the direction chosen to browse the mesh. It is then essential 
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to answer the following two questions: from which triangle 
can we begin decomposition? And how do we ensure that 
we scanned the entire surface and that the entire mesh was 
processed? 

To answer these questions, we based our approach on spiral 
scanning method proposed to ensure a progressive compres- 
sion in [26]. The use of spiral scanning method during 
compression gives very good results in terms of minimizing 
the amount of used memory during compression. We will 
readopt this method for 3D watermarking. Our objective is 
to ensure that the watermarking process has swept the whole 
mesh and that there is no untreated part, which is tricky, as the 
topology of the object is varied. The choice of course affects 
directly the memory size used. 

Our idea is, then, to take a low frequency triangle as a unit 
and to provide a tool for scanning the mesh as shown in figure 
7. The step of calculating the neighborhood is very costly 





Fig. 7. Visualization of the watermarking process, using spiral scanning , of a 
3D mesh (Venus): (A) spiral acquisition sends a triangle with its neighborhood 
to initialize our watermarking algorithm. The next steps correspond to the 
evolution following the playback spiral. (B) reconstruction with all levels of 
resolution (details) 

in terms of the number of operations. To solve this complex 
problem, instead of searching the neighbors throughout the 
mesh, we proposed a procedure for locating the smallest area 
covering them using the following properties: 

Property 1: The neighbors of a central triangle Tc are among 
the sons of the father of Tc. 

Property 2: Of the sons of the father of a triangle T, the only 
one that is central is a neighbor. Other neighbors of T are 
among the neighbors’ son of the father of T. 

Property 3: If two triangles T1 and T2 are adjacent then one 
of them is central. Once this is known, it facilitates the search 
for other neighbors according to property 1. 

Property 4: Of the son of the father of a triangle T, the only 
one that is central is a neighbor. Other neighbors of T are 
among the neighbors’ son of the father of T. 

As shown in figure 8, the acquisition follows an oriented 
movement so that the scanning of the 3D mesh does not 
leave untreated portions. To do it, we follow, as a reference, 
the list LO = {a, c, b} for acquiring the triangles Ai, Bi and 
Ci, respectively colored in yellow (near the point a), green 
(vicinity of the point c) and blue (vicinity of the point b). 
This represents a complete initialization turn. When detecting 
a new triangle, we send it to the watermarking buffer and we 
update the new reference list. The second round will reference 
the newly created list being LI = {rO,rl, and so on. 

The end of transmitting triangles to the watermarking buffer 
corresponds to a new empty reference list. To explain in a 
clear and simple way, we propose algorithm 1. 

Algorithml : Spiral Scanning Method 

Vertex St = ”a” // Initialization 

// Ed is the third vertex triangle containing ”a” and ”b” and 




A set of GOTs 


Fig. 8. Spiral Scanning method. 


which is not in the StartingLlist. 

Vertex Ed = ThirdVertexOf(CurrentTriangle,a,b) 
StartingList=a,b,c // Definition of Lists LO (StartingList). 
EndingList=Ed // And LI (EndingList). 

while ( EndingListl = Null ) 

// The algorithm remains in the loop until the ’’EndingList” 
generated is empty. 

while ( WhileN ext (Starting List)\ = Null ) 

// Loop on the entire ’’starting” vertex to find the triangles 
of the entourage. 

Temp=ThirdVertexOf(CurrentTriangle,St,Ed) 

// Loop on all the ’’Ending” vertices. If the third vertexe 
’’temp” is the first element of the ’’EndingList” Then we 
move on to the next item in StartingList. 
while (t)emp != Lirst (EndingList) 

Add Temp in EndingList 
// Update of EndingList. 

Stat(T) = MarcTriangle (St, Ed, Temp) 

// Mark visited triangle T. 

Ed = temp 

Temp =ThirdVertexOf(CurrentTriangle,St,Ed) 

// Move to the next element in the StartingList. St = Next 
(StartingList) 

// The current StartingList is recorded in ’’List” that will 
contain the list of all the points covered spirally. 

List=List + StartingList; 

// switch the Starting List with the Ending move to the next 
round. 

StartingList = EndingList 
End. 

The application of this method leads to the division of the 
mesh into four parts (see figure 8): 

Part 1: treated and cleared (1). 

Part 2: treated and stored in memory (2). 

Part 3: during treatment (3). 

Part 4: not yet treated (4). 

2) Wavelet transform: The application of the spiral scan- 
ning method has decomposed the mesh into GOTs. Lor every 
GOT, we apply a wavelet transform to generate the vector of 
wavelet coefficients. This vector then undergoes amendment 
during embedding. Inserting data in the multi-resolutin domain 
leads to the increase of the amount of embedded information, 
better preservation of the mesh quality and the improvement 
of the robustness of our algorithm. 
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Fig. 9. Spiral scanning method: memory status. 


3) BCH encoder: To protect information against any type 
of alteration and to be able to correct the eventual errors that 
may occur [27], we used BCH error correcting code. This latter 
encode data before being inserted. As a result, a codeword 
containing the initial information and a set of control bits, is 
generated. This new data will be embedded into a host mesh. 

4) Embedding information: The vector generated from each 
GOT will be presented in the spherical coordinate system. 
Only the p component, which refers to the module of each 
wavelet coefficients, will be modified during embedding. This 
first component undergoes a modulation. It will be multiplied 
by a modulation coefficient determined experimentally to 
obtain a modulated coefficient that we call C' . Insertion occurs 
then according to the bit of data as seen in formula 8. 


f C = 0.3 if biti = 0 

\ C = 0.7 if biU = 1 


( 8 ) 


Once watermarked, we apply a demodulation to C’ and we 
present coefficients again in the Cartesian system. Finally, the 
watermarked GOT will be released from memory to upload 
the next one. Our approach relies then on the use of the 
spiral scanning method. At each time t, only a portion of the 
mesh is saved in memory to be watermarked. Once this part 
is watermarked and as long as there is untreated parts, the 
execution of our watermarking algorithm restarts. When the 
whole mesh is treated, an inverse wavelet transform occur to 
rebuild the watermarked mesh. 


B. Extraction 


on the results of modulation (see figure 10). After that, we 
move to the next coefficient. Once this part of the mesh is 
processed and the watermark is extracted, the following GOT 
is loaded into memory to be treated in its turn. Once the 
entire mesh is processed and the totality of the watermark 
is extracted, collected data will be decoded using a BCH 
decoder. This treatment allows the correction of wrong bits. As 


Watermarked GOT 


r-H 


Wavelet 

Transform 

Buffer 


WCV of 
the GOT 


End of computing the wavelet transform 
( sending request ) 


Wavelet coefficients 
acquisition 


Transition to 
spherical system 


^ p component 


4 ---, 



^Bit=0 


_Noj 


BCH Decoder 


Watermark Extracted 


Fig. 10. Extraction Step. 


shown in figure 10, only the watermarked mesh is used during 
extraction. Our watermarking algorithm is then said ’’blind.” 


V. Evaluation tools 

To evaluate this approach, we have to study firstly the 
influence of our algorithm on the mesh quality. This allows 
calculating the difference between the original and the water- 
marked mesh. As for tools, we resorted to calculating MSQE 
and PSNR. 

Secondly, we must focus on the extraction of information. 
Our goal is to recover correctly all the information from the 
watermarked and attacked mesh to reach the decision-making 
step. During this step, we have to compare the difference 
between the information inserted at the beginning and that 
extracted in order to conclude on the robustness of our 
algorithm. This amounts to calculating the correlation values 
between the inserted information and the extracted one. 


This step allows the extraction of the inserted information 
from the mesh. Unfortunately it is not always possible to 
extract all data correctly. This is due to treatments (also called 
attacks) applied to the watermarked mesh. The aim of our 
watermarking algorithm is to retrieve information correctly in 
spite of any kind of attack: the robustness criterion. 

We propose in this part, the extraction step of our approach. 
Our primary goal is to extract all the information correctly. 
To achieve this objective, we decompose the mesh into GOTs 
using spiral scanning method. For each GOT, a wavelet 
transform is applied to have a wavelet coefficient vector. For 
each coefficient presented in a spherical system, we apply 
modulation. Finally, we extract the inserted bit depending 


A. MSQE and PSNR 

Embedding a watermark in a host mesh should not affect 
the mesh quality. To evaluate this criterion, we should find 
differences between watermarked and original meshes by 
computing Mean Square Error (MSQE, MSE or MSDM) [28]. 
The MSQE is then calculated using formula 9. The main 
idea is to evaluate the distance between the two meshes. It 
represents the distance between a point x from the first mesh 
and a surface from the second one [28]. 

d(M, M) = ( l — -f d(x,M)dxf (9) 

K area(M) J xeM 
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The MSQE is then calculated using formula 10. 

MSQE = max{d{M , M), d(M, M)) 


TABLE III 

MSQE, PSNR AND CORRELATION VALUES ACCORDING TO WATERMARK 
(10) LENGTH. 


Another comparison tool we can also use is the PSNR (Peak 
Signal to Noise Ratio) measured in decibels (dB). This pa- 
rameter calculates the ratio between the signal dynamics and 
the error of the watermarking. 


PSNR = 20 x logio ( 


Bounding Box 
MSQE ’ 


(ID 


B. Correlation 

To evaluate the robustness criterion of our algorithm, we 
chose to calculate the correlation value between the inserted 
information II and that extracted 12. This allows measuring 
the intensity of the link between the two latter (see formula 
12 

c (Er = i^-^)x(Er=i^-/2) (l2) 

yJEuEi - N 2 x JEuEi - N 2 

n refers to the size of information. We can say that we 
were able to extract correctly all the information when the 
correlation value obtained is close to 1. 

VI. Approach experimentation 

For evaluating our watermarking algorithm, two criteria 
must be taken into consideration. The first one is visibility. 
It aims at measuring the impact of our algorithm on mesh 
quality. The second one is the robustness criterion. It focuses 
on the ability of our approach to extract data correctly from 
watermarked and attacked mesh. 

To test our approach, we used multi-resolution meshes stored 
in multireslution files having variable sizes. Table II summa- 
rizes the different meshes used during tests. 

TABLE II 

Data used during tests. 


Watermark length (bits) 

5 x 10 4 

15 x 10 4 

25 x 10 4 

MSQE 

0.7 x 10“ 6 

10 x -6 

1.2 x 10“ 6 

PSNR 

130 

124 

126.35 

Correlation 

1 

1 

1 


The p component of each wavelet coefficient, before being 
modified, is multiplied by a modulation coefficient determined 
experimentally. This coefficient is related to the strength of our 
algorithm and the visibility criterion. The higher the value of 
coefficient is, the more affected the mesh quality becomes. 
In order to study the influence of this coefficient on the 
watermarked mesh quality, we present table IV. Results show 
that, with a coefficient equal to 10000, we obtain an MSQE 
equal to 1.2 x 10 -6 and a PSNR value of about 126db. Working 

TABLE IV 

MSQE, PSNR AND CORRELATION VALUES ACCORDING TO MODULATION 
COEFFICIENT. 


Modulation coefficients 

MSQE 

PSNR 

Correlation 

10 

0.1 

45 

1 

100 

10“ 4 

60 

1 

1000 

0.2 x 10“ 5 

95.5 

1 

10000 

1.2 x 10“ 6 

126.35 

1 

100000 

0.5 x 10“ 6 

130.7 

1 


with a modulation coefficient equal to 10000, there is almost 
no difference between the original and the watermarked mesh. 
We will maintain this value for other applied tests. 

To locate our algorithm relatively to existing work, we com- 
pared our results with recently published results in terms 
of insertion rate, MSQE, PSNR and Correlation. Table V 
summarizes the results found during comparison. Although the 


Name 

Triangle numbers 

Vertex numbers 

Feline.dat 

516096 

258046 

Horse.dat 

225280 

112642 

Venus.dat 

81920 

40962 

Rabbit.dat 

70658 

35329 


A. Visibility criterion 

The objective of this section is to test the effect of our 
algorithm on the quality of watermarked mesh. To do this, we 
watermarked several 3D meshes and each time we calculated 
the MSQE and PSNR. We aim at finding a compromise 
between watermarking strength and invisibility criterion (see 
table III). Results presented in table III show that even with 
a large insertion rate (watermark=250000 bits), our algorithm 
does not affect the quality of the mesh. We can conclude, 
from an MSQE equal to 1.2 x 10 -6 and a PSNR value equal 
to 126.35, that our algorithm preserves the visual appearance 
despite the important size of inserted information. 


TABLE V 

Compromise between insertion rate, visibility and 

CORRELATION: COMPARISON WITH LITERATURE. 


Approach 

Insertion rate 

MSQE 

PSNR 

Correlation 

Our approach 

250000 

1,2 x 10“ 6 

126,35 

1 

[11] 

765 

0,004 

- 

1 

[3] 

- 

- 

68,78 

1 

[5] 

12732 

0 

84,47 

1 

[4] 

39707 

0,0 

84,13 

1 

[19] 

- 

- 

92,45 

1 

[28] 

1045515 

5 x 10“ 6 

- 

1 

[12] 

32 

- 

- 

1 

[13] 

21022 

2.7 x 10“ 5 

- 

1 

[7] 

10650 

0.2 x 10 -3 

- 

1 


number of bits that we can insert makes the insertion rate of 
other approaches insignificant (250000 bits in our works and 
about 39000 in others works), we still present best PSNR and 
MSDM values. Our approach is consequently able to insert 
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the highest number of bits in host mesh while keeping mesh 
quality. 

B. Attacks 

Evaluating the robustness criterion need the application of 
attacks after watermarking the host mesh. According to Kai 
Wang et al [29], attacks which threaten correct retrieval of 
information from a watermarked mesh can be classified as 
follows: 

• Geometry attacks: This category of attack tends to change 
coordinates of vertices without modifying the topologi- 
cal information (connectivity). Similarity transformation, 
Noise addition, smoothing and coordinate quantization 
are examples. 

• Connectivity attacks: This kind of attack modifies the 
connections between vertices without changing their po- 
sitions. Only the topological information is targeted. We 
cite simplification as example of connectivity attacks. 

1 ) Similarity transformation: This category, which is made 
up of three attacks: translation, rotation and uniform scaling, 
does not cause any alteration on the form of the mesh 
(see figure 11). Results, present in table VI, assert that our 


Q 



Rotation Uniform Scaling Translation 


Fig. 11. Similarity transformation Attacks. 

algorithm is robust against similarity transformation attacks. 
Indeed, the application of translation, rotation and uniform 
scaling to watermarked meshes, has not prevented the correct 
extraction of data. 


TABLE VI 

Robustness against similarity transformation attacks. 



Translation 

Rotation 

Uniform Scaling 

Correlation 

1 

1 

1 


2 ) Noise addition: The main idea of this attack is to modify 
the coordinates of vertices using a pseudo-random generator. 
This modification follows formula 13: 

x i = Xi + a x d 

Vi=yi + axd_ (13) 

z% = Zi + a x d 

With d is the distance from the center of gravity of the 
mesh and a a pseudo random number. In order to study 
the robustness of our algorithm against noise addition, we 



Fig. 12. Noise Addition Attack. 


carried out several tests. Each time, we apply this attack to 
a watermarked mesh, we extract data and we calculate the 
correlation value. Results, depending on the noise level and 
the proportion of vertices affected by noise are presented 
in tables VII and VIII. As shown in Table VII, correlation 

TABLE VII 

Correlation depending on noise level. 


Noise Level 

lO" 1 

10“ 2 

10“ 3 

10“ 4 

10“ 5 

10“ 6 

Obtained C 

0,06 

0,2 

0,5 

0,99 

1 

1 

C in [30] 

0.1 

— 

0.3 

— 

0.4 

— 

C in [8] 

0.02 

— 

0.6 

— 

0.8 

— 

C in [31] 

— 

— 

— 

0.05 

0.3 

0.55 


value is acceptable for a noise level down to 10 -4 . Correct 
retrieval of information is possible in this range of values. By 
comparing the correlation values of our approach to recently- 
published results, we note that our approach is an outstanding 
improvement over existing approaches. Changing the number 

table VIII 

ORRELATION DEPENDING ON PROPORTION OF VERTEX AFFECTED BY 

Noise. 


Mesh Modification level 

30% 

50% 

75% 

100% 

Obtained C 

0,972 

0,970 

0,956 

0,941 

C in [8] 

0,88 

0,86 

0,83 

0,71 

C in [3] 

0,482 

0,418 

0,302 

0,101 


of vertices affected by noise has not led to a large drop in the 
value of correlation. In fact, we kept values close to 1 . Results 
in [8] and [3] show a significant decline in correlation values. 

3) Smoothing : To apply a smoothing attack to a mesh, 
formula 14 should be used: 

x i = Xi + dF actor x d x 

Vi = Vi + dF actor x d y (14) 

Zi = Zi + dF actor x d z 


DFactor is a manually initialized parameter. d x , d y and d z 
should be calculated as shown in formula 15. 


dy 

d z 


E vertexN umber v — wertex Number 

> Xo—Xi 

im 1 Z — j j = 1 J 

V ertexNumber 

E vertexN umber \ EvertexN umber 

i = l / — 1 ^ ^ i 


E ver 
im 1 


vertexNum 


V ertexNumber 

mber v - EvertexN u 

2-g= i 


V ertexNumber 


(15) 


To study the effect of smoothing during extraction we did 
many tests. Found results are presented in table IX. For 
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TABLE X 

Correlation depending on quantization level. 


Quantization Level 

10 

12 

13 

14 

15 

20 

C in [31] 

0.3 

0.4 

0.45 

0.6 

- 

- 

Obtained C 

0,14 

0,628 

0,954 

1 

1 

1 


Fig. 13. Smoothing Attack. 

a dFactor value less than 10 -7 , we obtained a correlation 
value near 1. Comparing these results with those of previous 
published work published [8], we note that with our present 
approach we have reinforced the robustness against smoothing 
attack. 


table ix 

Correlation depending on Proportion of deformation 

(SMOOTHING LEVEL). 


dFactor 

10- 5 

10“ 6 

lO" 7 

10“ 8 

10“ 9 

10-to 

C in [8] 

- 

- 

- 

0.18 

0.31 

0.43 

Obtained C 

0,034 

0,2 

0,848 

0,968 

1 

1 


5) Simplification : The main idea is to present the mesh 
with a number of triangles less than the original representation. 
Removing triangles from the mesh can alter the inserted mark 
and even destroy it. The main idea of this attack is to remove 



4) Coordinate quantization: This attack aims at quantifying 
vertex coordination using two previously calculated factors 
according to the maximum and minimum values along x, y 
and Z Called X m i n , Umax ? Umim % max 3IKl . Q1 

refers to the quantization level which is initialized manually 
(see equations 16 and 17). 

Q-h/Dn %ma x X min 

oiep x — Ql 

Stepy = ( 16 ) 

Step, = 


Fig. 15. Simplification Attack. 

triangles from the mesh. Deleted triangles may be carrying 
the information which threatens the extraction phase. Applied 
tests, whose results are presented in table XI, show very well 
that we were able to extract all the information inserted despite 
the application of simplification in various levels. 

TABLE XI 

Correlation depending on simplification degree. 


Factor x 


X Xmin 

Step x 


X Step x %min 


Factory = 


y y-min 
Stepy 


X Stepy l/min 


Factor z 


Z Zmin 

Step z 


X Stcp z Zmi n 


(17) 


Previous factors are then used to quantify vertex coordinates. 
The quantization Follows formula 18. 


f F actor x if F actor x > 0, 5 x Step x 
( Factor x + Step x Otherxise 


(18) 


Coordinate quantization attack is also taken into account 



Original Mesh 



Fig. 14. Coordinate Quantization Attack. 

during the evaluation of our approach. Table X shows that 
we have good extraction results with quantization level up to 
13. This result became perfect (correlation =1) for a level more 
than 14. 


Iteration Number 

1 

2 

3 

4 

5 

6 

C in [30] 

- 

- 

- 

0.46 

0.31 

0.15 

C in [13] 

- 

- 

- 

0.79 

0.68 

0.61 

C in [7] 

- 

- 

- 

0.99 

0.97 

0.92 

C in [31] 

- 

0.6 

0.45 

0.25 

0.1 

0.05 

Obtained C 

1 

1 

1 

1 

1 

1 


6) Compression : The compression method shown in Figure 
16 includes two parts: a wavelet transform and a coding [32]. 
The input is a watermarked 3D mesh. 

This compression method processes the host mesh sequen- 
tially. Each time, the spiral scanning sends a GOT of coarse tri- 
angles with all its details to the memory of wavelet transform. 
The block of the wavelet transform calculates the wavelet 
coefficients and sends the result to be encoded. Coding Buffer 
has four steps: binary allocation, quantization, entropic coding 
and coding of connectivity. 

Obviously, compression should not alter the information al- 
ready inserted. Unfortunately, this type of attack presents a 
challenge for watermarking algorithms targeting 3D meshes. 
This justified the absence of experiments focusing on com- 
pression in the recently published algorithms. In this paper, we 
aim to concentrate on this type of attack. In order to conclude 
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3D meshes J 


Watermarking Buffer 




I 



Fig. 16. Compression Attack. 


TABLE XII 

Correlation depending on rate of compression. 


Bit/vertex 

0.1 

0.5 

1 

1.5 

2 

2.5 

3 

Obtained C 

0.07 

0.34 

0.4 

0.6 

0.89 

0.9 

1 


on the robustness of our algorithm against the compression 
attack, we applied several tests. We changed each time the 
compression rate. The result presented in table XII shows 
that for a rate greater than 2, we obtained good correlation 
values. All of the inserted information was correctly extracted 
when the compression rate is equal or higher than 3. Our 
algorithm is then robust against this type of attack even with 
low compression rate. This presents an improvement in the 
field of 3D watermarking. 

C. Memory consumption 

The approach proposed in this paper is based on the use of 
spiral scanning method. It consists in decomposing the host 
mesh into GOTs. Each time, a GOT is sent to memory to be 
watermarked. Once treated, it will be deleted from memory to 
allow loading and watermarking the next GOT. Evidently, this 
treatment reduces remarkably the rate of used memory and 
allows us to work even with a very small memory space as 
long as the application of wavelet transform is possible with 
this amount. 

Experimental results presented in Figure 17 show the effec- 



Fig. 17. Use of memory with spiral scanning method. 

tiveness of our algorithm in terms of memory consumption 
using multiple 3D objects. Indeed, reducing memory space has 
reached a value equal to 24% when watermarking the Feline 


object. This range of memory presents the minimum space 
required for the application of wavelet transform. Although 
these results are very motivating, improve even more the 
memory usage (in order to implement the algorithm or handle 
huge meshes) returns to the improvement of the wavelet 
transform used. 


VII. Conclusion 

In this paper, we propose a new watermarking approach for 
3D meshes. The main idea is to apply spiral scanning method 
to split the mesh (decomposition into GOTs) . At each time 
t, a GOT will be sent to memory to be treated. Treatment 
includes the application of a wavelet transform in order to 
generate the vector of wavelet coefficients. Components of 
this vector undergo modulation to be watermarked according 
to binary data. Finally, a demodulation phase and an inverse 
wavelet transform will be performed. Once this treatment 
is finished, the next GOT will be sent into memory. This 
process is stopped when the entire grid is watermarked. During 
extraction, the same sequence of steps will be executed using 
the watermarked mesh. Our algorithm is then said to be blind. 
The particularity of this work is the application of spiral 
scanning. This method of decomposition allowed a huge gain 
in memory adopted (reducing memory reached 24 % ). Indeed, 
we are able to control the memory amount used during the 
execution of our algorithm. Our algorithm can then work even 
with a very small memory space. 

Applied tests prove that our algorithm preserves mesh quality. 
It does not cause quality degradation of the mesh despite the 
large number of bits to be inserted (250000 bits). Previous 
displayed results, which present a considerable improvement 
compared to the results of recent works, assert that our 
algorithm is robust against several attacks such as similarity 
transformation, random noise addition, coordinate quantiza- 
tion, smoothing, simplification and compression. 

Concerning our future work, we think of changing the method 
of wavelet transform used to further reduce the amount of 
memory used. As for the criteria of robustness, we strongly 
believe in changing techniques used to improve the results 
already presented. 
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Abstract — Internet of Things (IoT) is an emerging technology 
which is covering everyday things from industrial machinery to 
consumer goods in order to exchange information and complete 
tasks while involved in other work. IoT based smart home 
automation system is a system that uses PCs, mobile phones or 
remote devices to control basic operations for home automatically 
from anyplace around the world using internet. The proposed 
intelligent home automation system differs from existing systems 
as it allows the user to operate the system from anywhere around 
the world by using internet connection along with intelligent nodes 
that can take decisions according to the environmental conditions. 
We implemented a home automation system using sensor nodes 
that are directly connected to Arduino microcontrollers. 
Microcontroller is programmed so that it can perform some basic 
operations on the basis of sensors data. e.g. fan is controlled on 
basis of temperature value and light is controlled on the basis of 
occurrence of motion in the room etc. Furthermore Arduino board 
is connected to the internet using Wi-Fi module. An extra feature 
this system provides is to monitor power consumption of different 
home appliances. The designed system provides the user remote 
control of numerous appliances locally as well as outside the home. 
This designed system is expandable, allowing multiple devices to 
be controlled. The objective of the proposed system is to provide a 
low cost and efficient solution for home automation system by 
using IoT. Results show that the proposed system is able to handle 
all controlling and monitoring of home. 


Keywords — Internet of Things (IoT), Wireless Sensor Network, 
Home Automation System, Energy Monitoring. 


microcontrollers etc. Figure 1 demonstrates that IoT 
applications are classified into three major categories as: 

• Society 

• Environment 

• Industry 

On the basis of classification the term “Things” can be 
distinguished in a different way and depends on the application 
domain in which it is used. 



Figure 1: IoT Applications 


I. Introduction 

Now a-days home and building automation systems are 
taking place of manual systems. They provide increased 
comfort when employed in private homes [1]. On the other 
hand they contribute an overall cost reduction and energy 
saving, which is todays major issue. A typical home 
automation system allows to control house hold appliances 
from a centralized control unit. These appliances include fan, 
lights, air conditioners, etc. For mostly commercial home 
automation systems, all devices should be compatible with 
centralized control unit [2]. 

The capabilities offered by the IoT make it possible to 
develop various IoT based applications. All the applications 
are build using many more smart things like sensors, actuators, 


In industry, IoT activities involve financial or commercial 
transactions between companies, manufacturing process of 
industries, service sector, banking security, and intermediaries 
etc. As a whole the thing may be product itself, the equipment 
used or transportation means used, or we can say everything 
that participates in product lifecycle. In environment, 
applications are based on activities like security, protection, 
monitoring and development of all natural resources such as 
energy management, agriculture, human body monitoring etc. 
Lastly in the whole society the thing may relates to devices used 
with in public places or devices used for assisted living. For 
example smart homes, smart cities and smart campuses etc. 

The proposed system can be easily integrated into the 
home’s electrical system and allows one to wirelessly control 
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appliances that are plugged into specially designed wall smart 
sockets. The designed system consists of Arduino Nano board, 
Arduino Mega 2560, NRF module, CC3000 Wi-Fi module, 
relays, keypad and sensors. Sensors are directly connected 
with Arduino microcontroller board in order to perform 
automated functions on the basis of data provided by sensors. 
Firstly data is processed by sensors and then forwarded to 
Arduino mega for further processing. Wi-Fi module is attached 
to Arduino mega to connect the whole system with internet. 
Sensors used in our system are DHT11 temperature and 
humidity sensor, PIR motion sensor, Current sensor, 
electromagnetic door sensor and MQ5 gas sensor. 

The system is capable of detecting when the user enters or 
leaves the room. Electromechanical relays are used to control 
light, fan and sockets on the basis of sensor’s data or 
commands received. Number of relays depends on number of 
appliances to be controlled. A remote is made using NRF 
module with Arduino Nano. The remote is used to control the 
smart socket locally. Current sensor is used to monitor the 
power consumed by home appliances to whom it is attached. 
The power consumption can be monitored using internet and 
is shown on LCD attached to the system as well. 
Electromagnetic sensor is used to monitor the door status, 
either the door is open or closed. 

The main challenges faced by smart home systems are high 
ownership costs, poor security, poor management, and 
inflexibility. Some systems provide complete solutions but 
those are very costly. These complex systems usually need to 
be integrated when the building is constructed and must be 
planned in advance. They are also difficult to upgrade or 
replace once installed. The overall investment adds up 
considerably and is financially infeasible in most cases. These 
drawbacks hinder the popularity of such systems. Secondly, 
world is in the grip of serious power crisis since many years. 
A reason behind these severe crisis is the wastage of energy. 
Normally it is seen that the electricity load shedding increase 
in busy hours. Therefore, our designed system is able to 
overcome all the above mentioned problems efficiently and 
cost effectively. 

The remaining part of this paper is organized as follows; 
related works is discussed in section II, section III outlines the 
internet of things. Prototype of proposed system is presented 
in section IV. Section V describes the components of IoT 
based home system. The results and its analysis is discussed in 
section VI, whereas last section concludes this paper. 

II. Related Works 

This section presents the existing techniques of smart home 
system based on IoT, ZigBee, Bluetooth and smart phones. 

In [3], smart homes are described to introducing the 
technology in your home environment for comfort, security, 
convenience and to provide energy efficiency to its occupants. 
In [4], author describes that with advancement in sensor 
networks there comes a rapid increase in automation. 
Everything going to be automated rather than manual. Due to 
rapid increase in internet users a technology named Internet of 
things (IOT) emerged. IOT is an emergent network of daily 


objects. Smart home system using IOT is monitoring and 
controlling basic home functions automatically or through 
internet using mobile or PC. Intel Galileo board is capable of 
integrating wireless communication, cloud networking and to 
remotely control various devices and appliances. System is 
flexible depending on the type of sensors. 

In [5], the author states that by making your home 
environment intelligent enough we can make life easy for 
disabled and elderly personals. In last few years there is much 
increase in home automation because of rapid increase in smart 
phone usage. Introduction of Internet of Things boosted up the 
research and implementation of smart homes. In [6], the author 
suggests his idea that In order to make smart home low cost 
and flexible micro web server based on Arduino Ethernet, 
hardware interface modules and android based application is 
used. Using this system authorized users can remotely control 
and monitor home devices connected through 3G/4G or Wi-Fi. 

In [7], the main objective of the author is to provide Home 
automation system by integration of smart phones, cloud 
computing, wireless and power line communication (PLC). The 
system will be capable of providing the remote access to switch 
on and switch off various home appliances with in home. This 
system facilitate the user by consolidating hand held wireless 
remote, PC based application and android application. In [8], 
the author explains a system which is based on standalone 
embedded system Arduino Mega Android Accessory 
Development Kit. ADK acts as intermediary between home 
appliances and android mobile or tablet as ADK put forwards 
the coming signal from the mobile/tablet to the devices. The 
author presents the design and implementation of the system 
that is capable of monitoring and controlling the home devices. 

In [9] the author suggests to design and implement a flexible 
and secure cell phone based home automation system. The 
design is based on a standalone Arduino BT board and the home 
appliances are connected to the input/ output ports of this board 
via relays. The communication between the cell phone and the 
Arduino BT board is wireless. This system is intended to allow 
a number of devices to be controlled with minimum changes to 
its core. Only authorized users are allowed to access the system 
as system is password protected. 

In [10] the author suggests an innovative, detached and 
flexible ZigBee based smart home system. The system is 
flexible and scalable that allows extra home appliances 
designed by multiple vendors to securely add to the home 
network with the minimum amount of extra work. The system 
allows its owners to monitor and control the connected devices 
locally, through multiple controls like any Wi-Fi enabled device 
which supports Java or using ZigBee based remote control. 
Moreover, in this system a common home gateway is used to 
integrate ZigBee based home automation system and Wi-Fi 
network. The network is interoperable, simple and flexible due 
to common home gateway that provides user interface, and 
remote access to the system. 

[11] Come up with an idea to control the home using 
Bluetooth technology. System provides convenient access and 
monitoring to the home appliances with in short range. 
AT89C51 single chip microcontroller is used to control the 
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designed circuit. Different functions are performed based on 
programming of the circuit. The system is suitable only for 
ordinary household applications. Only short range control of 
different devices is provided in this system. In [12] the author 
proposed a Bluetooth based energy management system in 
which devices communicate with energy management system. 
The devices communicate using Bluetooth low energy 
technology. The proposed approach elaborates that devices 
automatically goes to stand-by state in peak hours while small 
devices remain in working condition. This saves energy as well 
as reduces the electricity bills. 

In [13] the author introduced the emerging technology of 
IoT for the purpose of environmental monitoring in smart 
homes. The system is designed by integrating wireless sensor 
network and internet of things. Sensor nodes are responsible for 
sensing the provided environment and send the data to central 
node. Central node is connected with Wi-Fi router acting as a 
gateway in this system. The user monitors the data through 
webpage. In [14] the author proposed a smart home system 
based on Wi-Fi technology and IoT technology. In this system 
a low cost Wi-Fi module is used with different sensors and 
home appliances in order to monitor and control them. Power 
line communication is used to control the devices. System can 
be locally accessed through mobile phone or tablets. For remote 
access of the system, the system comes up with home proxy and 
remote server. They are communicating with the smart device 
using XMPP protocol. Remote server is capable of controlling 
and communicating different smart homes. 

In [15] the author designed a system in order to minimize 
transmission delay and to handle large data. A new protocol is 
used to assist living in smart homes. The protocol used is named 
as wellness sensor networks. This protocol was used in early 
smart homes in nineties. A local database server is used for 
building statistics of data and data is sent to Webserver for 
remote access. In [16] the author proposed an IEEE 802.15.4 
based smart home and energy monitoring system by using 
CC2430 on chip technology. Main focus of this paper was to 
design a hardware which is capable of doing automatic load 
balancing and load prioritization results in bill saving. In [17] 
the author proposed to design a system based on ZigBee based 
wireless sensor network. The system can be remotely accessed 
via home gateway designed based on a LM3S9B96 chip and a 
RF CC2520 chip. The gateway software uses real time 
operating system free RTOS embedding both TCP/IP and 
ZigBee protocol stack on it. 

III. Internet of Things 

In this section, we briefly discuss the IoT which is an 
emerging network of daily objects from industrial machinery to 
consumer things that can exchange information and complete 
tasks when you are busy in other activities. A basic example of 
such objects is smart home automation which uses mobile 
devices or computers to control home devices using internet 
from anyplace around the world. An automated home is 
sometimes called a smart home. It is intended to save the human 
energy and electric energy. The proposed home automation 
system differs from other existing systems as it allows its users 
to control the system from anywhere across the world using 


internet connection along with intelligent nodes which can take 
decisions according to the environmental situation. There are 
other domains e.g. healthcare, industrial automation, 
transportation, and natural and other disasters where IoT can 
play incredible role and can help us to improve quality of our 
lives. 

IoT elements help us to better understand the real 
importance and functionalities of the IoT. In IoT Identification 
of each object helps to identify the objects uniquely. Objects 
may use public IP addresses instead of private for identification. 
For providing a clear identity to objects within the network 
different identification methods are used e.g. IPv4 and IPv6. 
Sensing includes collection of information from different 
objects in the network and to send that information on the cloud, 
local server, or database. Then the gathered data is analysed in 
order to take particular actions based on services and data. In 
IoT data can be gathered from smart sensors, wearable sensing 
devices and actuators. In IoT heterogeneous objects are 
connected with each other to provide specific smart services. 
IoT objects communicate on lossy and noisy links because of 
the low power operation. Communication in IoT takes place 
using protocols IEEE 802.15.4, Bluetooth, WI-FI, Z-wave etc. 
In IoT data is processed using dispensation units e.g. 
microcontrollers and software applications e.g. cloud services. 
The processing units and software applications specifies the 
computational ability of the IoT. Hardware platforms used for 
IoT are i.e. Arduino, Raspberry PI, Intel Galileo, UDOO etc. 
The other important calculation part of the IoT is Cloud 
platforms which facilitates different objects in order to send 
their collected information to the cloud. The received data on 
cloud is then processed in real-time and helps the user to benefit 
from the knowledge extracted. IoT services are important for 
improvement of our life-style; these services include Identity 
related services, Information aggregation services, 
Collaborative aware services and Ubiquitous services. These 
services are improving our daily lives by providing Smart home 
systems, intelligent transportation systems, Industrial 
automation, Smart health care, Smart grid and Smart city. In 
IoT semantics refers to the ability of extracting knowledge from 
different objects for the provision of required services. 
Semantics recognize and analyse the information to take 
decision in order to provide exact services. 
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Figure 2. IoT elements 


In IoT sensors are capable to sense, think and perform 
actions by having them communicate together, to share records 
and information to make decisions. The general IoT elements 
are shown in figure 2. The general idea behind IoT is that each 
domain precise software is interrelating with domain unbiased 
submissions, while in each area devises and actuators 
interconnects with each other without delay. IoT is projected to 
be used in smart homes as it allows its users to routinely open 
their garage when reaching at their gates, turn on the fan when 
temperature is getting high, notify the users if there is a gas 
leakage in the kitchen, and control their appliances when away 
from home using internet. 
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The architecture of IoT is consisted of three basic layers, 
first is Perception layer which signifies the physical objects 
such as sensors and actuators to perform different actions or to 
monitors objects such as motion, temperature, humidity, 
energy, etc. Second is Network layer which is responsible for 
the transfer of data produced by the sensors by means of Wi-Fi, 
Bluetooth, Infrared, GSM, ZigBee, etc. This layer allows the 
IoT applications to work with various types of objects with 
different kind of specified hardware platform. Other processes 
handled by this layer are data management process and cloud- 
computing. Network layer is mainly used to collect data from 
perception layer and processing the data for application layer. 
Third is the Application layer which directly interacts with the 
user to which it offers the requested services. For example, this 
layer can provide gas and temperature measurements to the user 
who demands for such data. The last layer has significant 
importance in IoT because it provide high quality intelligent 
services to the users they need. This layer covers various 
marketplaces such as smart Healthcare, smart homes, industrial 
automation, smart grid, etc. 

IV. Methodology 

A. Prototype of Intelligent Home Systems 

The designed IoT based intelligent home system comprises 
of four major portions as follows, 

Automation: In this portion sensors are connected with the 
controller (Arduino Mega) and automates different objects as; 
the light will automatically turned on when someone enters the 


room, the alarm will notify you when there is a gas leakage in 
the kitchen and the alarm will also notify you when the main 
gate remained open for certain time. 

Energy monitoring: In this portion temperature and current 
sensors are connected to the controller (Arduino Mega) 
temperature sensor is used to automates the fan in the room as 
the fan will automatically turned on when the temperature rises 
to certain value and the fan speed will gradually increase with 
the increase in temperature. Current sensor is used to monitor 
the energy consumption of the appliances at home. The Wi-Fi 
module is used to send the data to the internet and is accessed 
at web page. The values of energy consumption and 
temperature are shown on web page and the control of the 
appliance is also connected on the web page which can be 
accessed globally. 

Smart Socket: In this portion we designed a remote in which 
keypad and wireless module (NRF) working as sender are 
connected with controller (Arduino Nano). On the other hand 
socket is comprised of controller (Arduino Nano) which is 
connected with wireless module (NRF) working as receiver and 
relay. Remote is used to control the socket locally. 

Smart Meter: In this portion smart meter is designed which 
comprises of current sensor and LCD connected with controller 
(Arduino Mega). Smart meter is designed to calculate the 
energy consumption and number of units consumed in a smart 
home. 

The detailed working of IoT based intelligent home system 
is shown in figure 3 . 
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Figure 3. Prototype of intelligent smart home system 
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B. The designed architecture is comprised of four folds: 

Automation part is built by using following components as 
following: Arduino mega is used for controlling the whole 
automation part of the project. It is connected with the other 
modules used in automation as shown in figure 4. It actuates 
different devices on the basis of sensors data. Sensors attached 
to Arduino Mega are following: Temperature and humidity 
sensor, Motion Sensor, Electromagnetic door sensor, Gas 
Sensor, Electromagnetic Relays are used to control and 
automate the electrical appliances on the basis of sensors data. 
We used 5 Volt to power supply for micro-controller. 



Figure 4. Automation flow 

Energy monitoring part is used to monitor and control 
the energy consumption of home appliances especially heavy 
appliances by using web page as shown in figure 5. This part 
consists of following modules: CC3000 Wi-Fi module is 
attached to Arduino mega; we used this to send data to the 
internet and to receive commands through web page. 



Figure 5. Energy monitoring flow 

This circuitry can be used to measure the power consumption 
of different home appliances. It measures voltage with an AC 
to AC step down transformer acting as power adapter and 
current by using a CTO 13 clamp sensor. This makes the system 
pretty safe as no high voltages are used in this circuit. This 
designed portion can calculate real power, apparent power, root 


mean square voltage and root mean square current. Arduino 
digital domain is used to make all calculations for this portion. 
LCD display is attached to Arduino mega, we used it for energy 
monitoring locally. Arduino mega is controlling the Wi-Fi 
module and other circuitry used in this part of the project. It gets 
the data from current sensors, calculates desired values and 
shows these values on web page and LCD as well. 

Figure 6 represents the flow of smart socket. Smart 
socket is designed to remotely control the appliances within the 
room by a specially designed remote. This part consists of 
following modules: Arduino Nano is connected with NRF 
module and electromagnetic relays in order to automate the 
home appliances like light, fan etc. It also controls the specially 
designed smart socket using remote. NRF module is used at 
both ends to wirelessly control socket using remote. Keypad is 
attached to the microcontroller on remote side in order to 
control the socket. 



Smart Socket Remote 

Figure 6. Smart socket flow 


Smart meter is used to monitor energy consumption of 
home, is shown in figure 7. This circuitry is used to measure 
electrical energy consumption in your home. It measures 
voltage with an AC to AC step down transformer acting as 
adapter and current with CTO 13 clamp sensor. This makes the 
system pretty safe as there is no interaction needed with high 
voltages. 



Figure 7. Smart meter flow 
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3. Smart Socket 

Smart socket is comprised of Arduino Nano microcontroller 
boards with electromagnetic relay and NRF transceivers. 
Remote is also build of same components, the only 
differentiation is that relay is replaced with keypad in order to 
send the commands to the socket. It provides easiness to elders 
in order to control any device with in the room as NRF 
communicates with in eight to ten meters. Figure 10 shows the 
smart socket. 


The energy meter can calculate real power, and the units 
consumed in KWH. Arduino digital domain is used for 
calculating values for this smart meter. LCD display is attached 
to Arduino mega, which displays power consumption at home 
and the unit’s consumed. It gets the data from current sensors, 
and calculates desired values. 

V. Components of IoT based Intelligent Home System 

This section briefly describes the components of IoT 
based home system. 

1. Automation 

The components used in automaton part of this system are 
PIR motion sensor, DHT 1 1 temperature and humidity sensor, 
MQ5 gas sensor and electromagnetic door sensor as shown in 
figure 8. On the basis of data coming from sensors different 
functions are performed like fan speed is controlled according 
to temperature values and light turns on when someone enters 
the room. Their statistics is also build. 


Figure 8. Home automation 

2. Energy Consumption Monitoring 

The components used in energy monitoring part are CTO 13 
current sensor, a step down transformer, LCD, cc3000 Wi-Fi 
module, capacitors and resistors are shown in figure 9. Current 
sensor clamped on the positive wire of the appliance whose 
power consumption we want to monitor. The values are shown 
locally on LCD and remotely on the web page. 


Figure 9. Energy consumption monitoring 


Figure 10. Smart socket 

4. Smart Meter 

Figure 1 1 shows the smart meter. Smart meter is designed with 
almost same components as used in energy monitoring part of 
the system. Only Wi-Fi module is not used in this part of the 
system. It is programmed so that it cannot only show us power 
consumption at home but also calculates the units consumed. 


Figure 1 1 . Smart Meter 
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VI. Performance Evaluation 

This section represents the performance evaluation of our 
intelligent home system based on IoT. The results are only 
shown for energy consumption, voltage monitoring, and 
temperature monitoring. 

The designed system is tested by installing it in home 
environment. Interconnecting with the home router using IPv4 
such that router worked as IoT application gateway for the 
designed system. Integrated system is continuously used and 
generated real time graphical representation of the sensed data. 
The rest of this section presents the results. 

Figure 12 shows the energy consumption of a 100W bulb 
from 9am to 1 1 am. Power is calculated by using ampere and 
voltage as under: 

Power = Voltage * Current (amperes) 

Both these values are graphically represented on IoT webpage 
working on the embedded static IP of the Wi-Fi module. If some 
appliances is consuming more power and exceeding the 
threshold (a maximum limit), the user can control the appliance 
through IoT webpage. 

The real time temperature observed on the webpage against 
time is shown in figure 13. Thus, on the basis of temperature 
data the fan speed is automatically controlled. On the other, 
hand the user can control the socket from the IoT webpage to 
turn the air conditioner ON or OFF after getting temperature 
values. 

Figure 14 shows the input voltage fluctuation for a 100 watt 
electric bulb from 9am to 9pm. It is observed that in busy hours 
the voltage drops to the minimum limit. As voltage vary, the 
power consumption of the appliances also vary. 

Figure 15 shows the power consumption of 1.5 Ton air 
conditioner from 9am to 1 1 am. The power consumption varies 
as voltages fluctuates, this is because the power is calculated as 
a product of voltages and amperes it consumes. Results are 
verified by taking these values using multi meter. 
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Figure 13. Temperature observed with respect to time 
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Figure 14. Voltage variations with respect to time 
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VII. Conclusion 

The designed system overcomes almost all the existing 
highlighted problems. This system not only monitors the 
sensors data but also actuates a process according to the 
requirement. This system highlights almost all important 
scenarios of smart home systems like: smart security, alarming, 
smart metering and energy monitoring. This efficient and cost 
effective model can be implemented in a real environment. In 
future, this work can be extended to monitor and control the 
home on a cloud. 
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Abstract — In this paper, we present our strategy adopted to 
deal with the mobility into publish/subscribe. Specifically, we 
focus on the management of the mobile users from one broker 
to another. In fact, the topic of mobility into publish/subscribe 
systems may cause many problems such as the increasing of the 
traffic into the network and the messages loss. To overcome these 
problems, we have created a selective scheme on the basis of an 
accurate selection. In fact, a threshold value is devoted to be 
the criterion for the selection of caching points. On the basis of 
this principe, we apply various network settings to explore the 
effectiveness of our approach. Hence, we extract the improvement 
of our approach on the messages loss, the caching cost and 
the propagation cost in function of buffer size, publication rate, 
period of disconnection and connect time. 

Keywords -Distributed Networks; Mobile Computing; Publish/- 
Subscribe; Prediction Management; Performance Efficiency. 

I. Introduction 

Nowadays the propagation of the pervasive computing de- 
vices, with the emergence of network access technologies 
(mobile wireless, wireline, and Internet), has led all kinds of 
devices to access networks. So, all these facts has given rise to 
the mobile computing paradigm. In this paradigm the users can 
be assumed stationary while on-line, but change the physical 
access points to the network. So, the users may temporarily 
disconnect from the network. Then, upon their connection, 
they expect to recuperate the data disseminated while their 
disconnection occurs. This demands a flexible middleware 
infrastructure, based on a scalable interaction style, to cope 
with the dynamic nature of mobile computing. In this context, 
the publish/ subscribe model can be very promising. 

The publish/subscribe paradigm shown in Fig. 1 consists of 
a set of distributed nodes elaborating the communication into 
the network. Two types of clients are existing based on their 
roles which are subscribers and publishers. The subscribers are 
information consumers. The publishers are information pro- 
ducers. The messages are passed from publishers to interested 
subscribers through the brokers. The route from publishers to 
all interested subscribers is coordinated by brokers for assuring 
the matching. 

The potential of the publish/subscribe communication 
model consists of the full decoupling of the interacted parties 


in time, space, and flow [1], [2], [3]. This decoupling makes 
the publish/subscribe systems flexible and scalable. Also, the 
brokers remove all explicit dependencies between publishers 
and subscribers. In fact, the multicasting mechanism imple- 
mented by brokers decouples publishers from consumers. 
This makes the publish/subscribe system a good candidate 
for mobile computing by inducing three important effects. 
The first effect is that a client can operate in the system 
without being aware of the existence of other clients. So, the 
client only know the structure of the event notifications for 
issuing its interest in the form of subscriptions. In practice, 
the publish/ subscribe approach could be easily exploited by 
a PDA to advertise its presence in a room and receive the 
services published there. The second effect is the ability of 
the client to connect and disconnect without affecting the 
other components. The third effect is the suitability of the 
publish/subscribe communication to cope with unannounced 
disconnection of clients, which characterizes mobile networks. 

Given the strength of this paradigm, a large number of pub- 
lish/subscribe middleware have been developed. These systems 
differ along several dimensions. Two main dimensions are 
usually considered fundamental which are the expressiveness 
of the subscription language and the architecture of the event 
dispatcher. 

The expressiveness of the subscription language classifies 
the publish/subscribe systems into three categories which are 
topic, type and content systems. The first two categories [4], 
[5], [6] are limited in the expressiveness. In the content-based 
systems [7], [8], [9], subscriptions contain expressions that 
permit sophisticated matching on the event content. In our 
work, we have applied our approach into a publish/subscribe 
middleware providing a content-based subscription language. 

Two types of architecture are proposed for publish/subscribe 
systems. In the centralized architecture a single component act 
as event dispatcher. So, the publish/subscribe system could 
not be scalable enough. Also, the risk of a single point of 
failure may occur. In the distributed architecture [10], [11], 
[12], a set of interconnected brokers coordinate in collecting 
subscriptions coming from clients and in routing events. This 
architecture contributes to the reduction of network load and 
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the increase of scalability. The topology of the distributed 
brokers differs from system to system. Most existing publish/- 
subscribe systems are implemented for fixed environments. So, 
several extensions are needed to make these systems able to 
cope with mobile subscribers. 

In recent years, more importance is accorded to the per- 
formance issues induced by the mobility of the users [13]. 
To solve these issues we propose to predict the mobility of 
subscribers. This prediction is based on a dynamic selection 
of the most probable locations that the subscriber moves to 
during its run. Our approach aims at forwarding the required 
information for the mobile users at their new locations upon 
their connection while minimizing the caching cost, the prop- 
agation cost and the messages losses. 



Fig. 1. The publish/subscribe Model 

The rest of the paper is formulated in this way: We give 
an overview of the management of subscriber mobility into 
publish/subscribe systems in section II. Then, we present 
the principe of our approach and the strategy that we have 
followed in section III. After that, we explore the adequacy 
of our approach by applying various network settings and 
compare it to the standard proactive scheme in section IV. 
Finally, section V concludes the paper. 

II. Related Work 

The mobility is an issue examined in many fields [14], 
[15], [16], [17]. Precisely, in this section we give an overview 
of studies which propose approaches for the management of 
mobile subscribers into publish/subscribe systems. In fact, the 
mobility of the subscribers is managed by three types of 
approaches. The first type is the durable subscriptions [18], 
[19], the second type is the reactive approaches [20], [21], [22], 
and the third type is the proactive approaches [23], [24], [25]. 
Each type differs from the other by the time and the manner 
that the transmission of the cached publications is realized. 

A. Durable Subscription Approaches 

The approaches classified in the category of durable sub- 
scription are limited to the operation of disconnection and 
connection to the same broker. As an example of approaches 
adopting this type of approach we find Elvin [19]. The 


followed strategy is realized on the basis of proxies used for 
the buffering of publications. Indeed, these proxies play the 
role of the disconnected subscriber in order to transmit to it the 
desired interest upon its reconnection. Thereby, two roles are 
attributed to this proxy according to the state of the subscriber 
whether it is connected or not. Hence, when the subscriber is 
disconnected, the proxy is considered as a subscriber express- 
ing the interest of the disconnected subscriber. Consequently, 
when the subscriber reconnects, the proxy is transformed into 
a server transmitting to it its cached publications. 

The same category of strategy was adopted into JMS [18]. 
In fact, we remark that this approach causes a considerable 
loss of messages when the subscriber does not reconnect to 
the same broker. In addition, this induces a significant load 
on the network by the fact of storing infinitely the interest 
of the disconnected subscriber into the old broker. In fact, 
the operation of caching is stopped only when the subscriber 
reconnects to the old broker. Hence, a serious degradation is 
induced on the performance of the system. 

B. Reactive Approaches for Mobile Subscriber 

Many systems have deployed the reactive strategy to handle 
with mobile subscribers. Rebeca [30], [31] is one of those 
systems. In fact, its strategy is elaborated on the basis of virtual 
counterparts related to the old broker. The reconfiguration 
of subscriptions is accomplished as soon as the subscriber 
detects the change of the broker. In fact, the junction broker 
between the old and the new broker transmits the buffered 
messages. Hence, the junction broker is the only responsible 
for the handoff requests. So, a significant increase in the 
handoff latency is induced. In addition, the junction broker 
risks to be overloaded by the massively quantity of messages 
moving through it. Therefore, the performance of the system 
is affected. 

The same type of strategy [21] was developed into SIENA 
[26], [27]. Indeed, this strategy is handled by the proxies 
related to the brokers of the publish/subscribe system. Un- 
fortunately, the massive use of proxy components affects the 
performance of the system. Also, many duplicated messages 
are overloading the network and are not deleted due to the 
expensive operation of elimination. 

Reds [28] is another publish/subscribe system that imple- 
ments a reactive approach [22] . The basic idea of the adopted 
strategy was founded on the selection of each broker as a 
caching point for the publication as long as it has a subscriber 
in its subscription table interested on this publication. This 
approach suffers from the overload on the client. 

Another reactive approach [29] was presented into Jedi[20]. 
The adopted approach runs as follows: The retransmission 
of subscriptions is first of all elaborated by the new broker. 
This retransmission is required for tracing the new routes for 
the publications. So, the publications matching the interest 
of the mobile subscriber are cached into the new broker. 
Consequently, the buffering is stopped in the side of the old 
broker. A major inconvenient in this approach is its limited 
scalability. 


352 https://sites.google.com/site/ijcsis/ 

ISSN 1947-5500 




International Journal of Computer Science and Information Security (IJCSIS), 
Vol. 14, No. 6, June 2016 


Another reactive approach based on the event mediators was 
proposed in [32]. The event mediators play the role of the 
buffer for the publications of the mobile subscriber. Hence, 
the event mediator sends the stored publications to the mobile 
subscriber upon its reconnection. The implementation of this 
approach is not clear and it suffers also from the limited 
scalability. 

In the same category of approaches, we find the strategy 
proposed in [33]. As the previous approaches, the old bro- 
ker buffers the publications for the mobile subscriber. As a 
drawback, this strategy suffers from the invocation of a high 
handoff latency in the large network. 

C. Proactive Approaches for Mobile Subscriber 

The main objective of proactive approaches is the minimiza- 
tion of the transfer delay of the cached publications. Thereby, 
these approaches are most times used for streaming and real- 
time applications. As a result of this minimization of delay, 
an increase in the network load is induced. 

As an example of proactive approaches, we find the strategy 
used in [25]. In fact, this approach is based on the exploitation 
of the neighbor graph. This latter represents the list of the 
brokers that will may be visited by the mobile subscriber. 
Indeed, this graph is constructed upon the reconnection request 
and the context transfer request invoked after the movement 
of the mobile subscriber. Thereby, all the brokers in the graph 
receive the subscriptions of the mobile subscriber before its 
movement. As a consequence of applying this approach, the 
network will be overloaded. 

Another proactive approach was applied in [23], [24]. This 
strategy employs a layer of replicators between publish/- 
subscribe system and clients. The replicators serve for the 
positioning of virtual clients at the possible brokers that may 
be visited by the mobile subscriber. A drawback of this 
approach is that the same subscription can be expressed by 
different subscribers related to the same broker. So, this broker 
risks to cache similar publications. This is due to the fact that 
the caching is invoked per subscriber. The major inconvenient 
of this approach is the huge load on the network and the 
incapacity to cope with the failures or long time disconnection 
of subscribers. 

This section has demonstrated that the reviewed works have 
accomplished the support of mobility with distinct rates of 
success. In fact, the management of mobility in these works 
poses different technological problems. Hence, these strategies 
are not yet effective and efficient for the management of 
mobility with no loss of messages. Also, they are characterized 
by their limitation in the performance metric. The main goal 
of our approach is the management of mobility in a trans- 
parent manner by assuring high performance. As mentioned 
before, the architecture of the publish/subscribe system can 
be distributed. In such cases, the management of mobility 
may affect the performance of the system due to the high 
traffic. Hence, it is primordial to create a new strategy with a 
reduced traffic. Also, our strategy requires to be flexible and 
scalable. To achieve such goals, our approach is based on the 


analysis of the most probable brokers to be next visited by 
the mobile subscriber. Next, we will perform an evaluation of 
our approach to extract its gains comparatively to the standard 
proactive approach. 

III. Proposed Approach 

The attention for extending publish/subscribe systems to 
mobile applications was little. In fact, the most extensions were 
based on a reactive strategy. This type of strategy suffers from 
the increase of network traffic and high handoff latency. In 
our work, we tend to manage subscriber mobility efficiently. 
The main idea relies on the selective predictive caching of 
messages prior to the movement of the mobile subscriber by 
the use of an intelligent mechanism. 

Recently, the users in real scenarios of mobility are mov- 
ing according to repeated routines. Indeed, we can offer 
a multitude of repeated movements every day for example 
from home to office, from home to school, from home to 
market and vice versa. So, we can rely on the probability of 
movements between brokers to extract dynamic calculations 
of probabilities on run time movement in order to offer an 
efficient management of mobility into publish/subscribe. 

Our selective scheme follows a predictive strategy and has 
proved its capacity to handle mobility with effectiveness. This 
effectiveness is assured through a clever selection of a set of 
brokers serving for the caching of published messages during 
the disconnection of the mobile subscriber. This predictive 
strategy in managing mobility contributes to the improvement 
of system availability. In fact, the past and actual states of the 
movement of the client are the key information for a correct 
prediction. Thereby, we use the information extracted from 
the actual movement of the subscriber in order to anticipate 
the future movements. So, the different movements between 
brokers are analyzed for the future prediction. 

As we have said, our strategy is based on the probability 
of movements between brokers. Hence, two important values 
are calculated dynamically which are the handoff weights and 
the threshold value. The handoff weight from broker A to 
broker B is obtained by fractionating the total sum of handoffs 
from broker A to broker B by the total sum of handoffs 
from broker A to all the other brokers. Hence, the threshold 
weight of broker A represents the average of weights from this 
broker. We obtain the weight threshold by fractionating the 
total sum of weights by the number of caching points. Hence, 
the selected caching points will require to have a handoff 
weight equal or greater than the threshold value. Thereby, 
a significant gain in the network traffic is approved by our 
approach through the elimination of the useless caching points. 
The following formulas clarify the two values with W is the 
weight, X is the number of handoffs, and nb is the number 
of caching points. 


W AB 


X A b 


E 


Xan 


N in caching points A 


( 1 ) 
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The previous approaches do not consider any criterion 
for the selection of caching points. Thereby, a considerable 
traffic on the network is induced in large networks. Therefore, 
the criterion of weight becomes essential to eliminate the 
useless caching points. So, the selected caching points are 
characterized by a weight value greater or equal than the 
threshold value. Hence, the calculation of different weights 
in a dynamic manner is recommended to obtain the updated 
values. 

In order to maintain the values updated, we apply upon 
each movement a function named Update- Weight for the 
broker from which the movement is invoked. In fact, for 
each movement the new broker notifies the old broker about 
the reassociation. So, the construction of caching points is 
elaborated correspondingly to the movements of the subscriber 
between brokers. 

The storing operation in the caching points begins when the 
mobile subscriber disconnects from its broker. To avoid a loss 
of messages into the caching points, the old broker is charged 
to send to the set of caching points the buffered messages 
published until the operation of caching begins. Hence, when 
the new broker to which the mobile subscriber connects is 
among the caching points, it will send directly the messages 
to the mobile subscriber upon its reconnection. 

The efficiency of the system is ameliorated since the caching 
points with a low probability to be visited are eliminated. 
Indeed, the caching points are selected intelligently by com- 
paring accurately the weights values to the threshold value. 
Hence, the caching points subscribe in advance and store the 
published messages instead of the mobile subscriber during 
its disconnection. A great major in our approach is its adapt- 
ability to all subscription language and to all general overlays 
topologies. 

The construction of the caching points is elaborated on 
the basis of changes occurred into the network. According to 
the movements of clients between the brokers, the values of 
weights and threshold are updated. Thereby, the set of caching 
points is dynamically varied. Indeed, the update of values is 
invoked for each movement. Hence, new brokers are added 
to the set of caching points and others are deleted. Therefore, 
we obtain a selection of caching points presenting the most 
probable brokers to be visited. Table I exhibits the selective 
dynamic behavior of our approach. Thus, a considerable load 
on the network is avoided. Added to that, we have tend to 
select always the closest caching points from which the mobile 
client recuperate its messages upon its reconnection when the 
new broker is not among the caching points. 

The quality of our approach can be measured by the fact 
that the new broker visited by the mobile subscriber belongs 
to the set of caching points. So, this indicates the exactitude 
in predicting the movement of the mobile subscriber. Hence, 
the recuperation of messages will be occurred directly from 


this new broker. So, the exactitude value can be expressed by 
the following equation: 

^ number of handoffs to a caching point(to , t ) 

number of handof fs(to , t ) 

( 3 ) 

This metric explores the adequacy of our approach in 
succeeding the prediction of the next accessed brokers. Hence, 
when the value of E x(t 0 ,t) is close to 1 that means that the 
selected caching points are useful enough for the prediction. 

IV. Evaluation 

We achieved all our experiments on a prototype implemen- 
tation of a distributed publish/subscribe system (PADRES) [34] 
to which we extend our approach and the standard proactive 
scheme. For all the experiments the same mobility model 
is applied. The evaluation of our approach is elaborated 
around the propagation cost, the caching cost and messages 
losses. The results of these evaluations are elaborated according 
to the buffer size, the publication rate, and the period of 
disconnection. Through these different parameters, we extract 
the gain of our approach compared to the standard proactive 
scheme. 

A. Propagation cost 

The propagation cost is the cost induced by the propagation 
of the subscriptions emitted by the mobile subscriber(ci) on the 
set of caching points. This propagation is occurred during the 
disconnection of the mobile subscriber which can be defined 
by the following equations with nh caching points(Bi) is the 
number of the caching points of the old broker Bi from 
which the mobile subscriber (ci) disconnected, and nb sub is 
the number of subscriptions of the mobile subscriber emitted 
before its movement and not yet matched. 

Propa cost(ci) = nbsub(ci) * nb caching point s(Bi) (4) 

We have varied the frequency of movement from the high 
to the low as shown in Fig. 2. We have used 30 and 120 
seconds as average duration of connection. In the scenario of 
high frequency of movement, the mobile subscriber connects 
for a short period of time, then it moves to other brokers. 
This results in triggering the propagation process in each 
movement. The number of propagated subscriptions depends 
on the subscription rate applied during the connection time. 
So, when the connection time is low, the number of propagated 
subscriptions is proportionally low. But, the high frequency of 
movement induces the increase of the propagation process. In 
contrast, when we have a long connection time, we risk to 
have an increased number of propagated subscriptions and a 
reduced propagation process as the frequency of movements 
will be reduced. 
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TABLE I 

Selective Dynamic Construction of Caching points 
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Fig. 2. Propagation cost 



B. Caching cost 

The caching cost is induced by caching the publications 
during the disconnection of the mobile subscriber (ci). This 
cost is calculated by the following formula with nb each 
pub(ci) is the number of cached publications, Cach cost(ci) 
is the caching cost, nb caching points(Bi) is the number of 
brokers at which the caching is occurred for the broker Bi 
from which the mobile subscriber disconnects. 

The selection of the caching points by eliminating those 
with a weight value under the threshold value permits to 
decrease considerably the caching cost. We varied the period 
of disconnection from 60 to 240 seconds. The increase in the 
period of disconnection induces an increase in the caching 
cost. By reducing the period of disconnection and the publi- 
cation rate, the cachnig cost will be reduced. 

Cachcost(ci) = nbrec pub (ci)*nb caching point s(Bi) (5) 


Fig. 3 shows the impact of our approach in reducing the 
caching cost on the network. The mobile subscriber moves 
between the brokers while varying the period of disconnection 
for different values of publication rates. The obtained results 
highlights the scalability of our approach and its capability 
to reduce the caching cost even for increased period of 
disconnection and publication rates. 

C. Loss of messages 

The loss of messages is a very important metric to consider 
in the evaluation of the performance. In our experimentations, 
we have used two values of buffer size 200 and 500, and 
we have varied the publication rate for different period of 
disconnections. Fig. 4 shows that the loss of messages in- 
creases proportionally to the increase of the publication rate. 
In fact, as more the publication rate is important, as more 
the number of stored messages will be important. Thereby, 
when the buffer becomes full, the cached messages will be 
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Fig. 4. Losses of messages for buffer size=200 





Fig. 5. Losses of messages for buffer size=500 


lost. So, as our approach presents a selected set of caching 
points, the number of cached publications will be considerably 
reduced. Consequently, our approach reduces significantly the 
loss of messages. This observation is confirmed by Fig. 5 when 
the publication rate is increased. So, for publication rate=27 
pub/min the loss of messages is minimized by 1350 messages. 
Thus, our approach permits to minimize notably the loss of 
messages especially for higher publication rate. 

The performed tests examined how much our approach can 
reduce the loss of messages and the traffic of messages in a 
distributed environment under different values of publication 
rates and period of disconnection. The obtained results showed 
that a considerable reduce is assured by our approach for 
higher publication rate. Hence, these tests have allowed us 
to compare exhaustively the behavior of our approach and the 
standard proactive scheme under the same conditions. 

V. Conclusion 

In this paper, we evaluated our strategy for the management 
of mobile subscribers into publish/subscribe networks. Various 
network settings are used to explore the adequacy of our 
approach compared to the standard proactive scheme. The 
obtained results show how much our approach can reduce 
considerably the loss of messages, the caching cost and the 
propagation cost in function of buffer size, publications rate 
and period of disconnection. In fact, our approach implements 
an efficient service for mobile subscribers. The efficiency is 
realized through a dynamic prediction for the next location of 
the mobile subscribers. The information for the prediction is 


extracted dynamically from the past and actual states of the 
mobile subscribers. 
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Abstract — Intelligent Transportation Systems are defined as 
those systems utilizing synergistic technologies and systems 
engineering concepts to develop and improve transportation 
systems of all kinds. Vehicular Ad-hoc Network (VANETs) which 
is an application of Mobile Ad-hoc Networks (MANETs) play an 
important role in ITS and emerged to provide Vehicle to Vehicle, 
Vehicle to Roadside and Vehicle to Infrastructure 
communications, aiming to improve safety on roads, exchange 
data between vehicles and provide different services to the users. 
According to special characteristics of VANETs like bandwidth 
limitation, high mobility, signal fading and real-time data 
communications, QoS provisioning in these networks is a 
challenging task. 

In this paper, we introduce an architecture for vehicular 
networks and a protocol stack which aims to reduce the 
processing overhead, make routing easier and provide Quality of 
Service in vehicular networks. Finally, after designing protocols 
and headers of the mentioned protocol stack, we will simulate our 
proposed idea in a vehicular environment and after simulation 
process, we will compare the achieved results with another 
scenario in which regular TCP/IP protocols are used. 

Keywords -component ; VANETs ; ITS; QoS ; Protocol Stack 

I. Introduction 

A. Intelligent Transportation System 

The Intelligent Transportation System (ITS) is a system 
which is able to exchange different kinds of information of its 
moving objects. ITS converges remote sensing and 
communication technologies to improve safety of 

transportation and make journey more enjoyable. As the 
objects are moving, wireless communication technologies play 
an important role in this system. ITS integrates information, 
communications, computers and other technologies and 
applies them in the field of transportation to build an 
integrated system of people, roads and vehicles by utilizing 
advanced data communication technologies [1]. ITS also 
includes a broad variety of usage scenarios and user 
preferences and interests. 

B. Vehicular Ad-hoc Networks 

The typical ITS scenario is land traffic on roads and the 
most common examples of ITS applications are the exchange 


of traffic information to provide roadside assistance, warning in 
case of emergencies and traffic jam. These services deal with 
data as, e.g. road condition, traffic light status and position of 
the single vehicle [2]. 

There are four typical ways of transportation, on the land by 
car or train, in the air or water. The most common traffic 
coming into our mind in combination with intelligent 
transportation systems is traffic on land. Among the means of 
transportation, the most prominent are cars, at the present time 
cars and other private vehicles are used daily by many people. 
The biggest problem regarding the increased use of private 
transport is the increasing number of fatalities that occur due to 
accidents on the roads. In recent years traffic congestion and 
accidents, as well as environmental pollution caused by road 
traffic and fuel consumption have become important global 
issues [3]. 

Vehicular networks are proposed to provide information 
exchange via Vehicle to Vehicle (V2V), Vehicle to Roadside 
(V2R) and Vehicle to Infrastructure (V2I) communications. A 
Vehicular Ad-hoc Network or VANET is a technology that 
uses moving vehicles as nodes in a network to create a mobile 
network and it turns every participating vehicle into a wireless 
router or node [4]. VANET is also capable of enhancing 
driving safety by exchanging real-time transportation 
information and it should upon implementation, collect and 
distribute safety information to massively reduce the number of 
accidents by warning drivers about the danger, before they 
actually face it [5]. 

VANETs have their own characteristics when compared 
with other types of MANETs. Authors in [6] describe the 
unique characteristic of VANETs as follows: 

• Predictable mobility 

• Providing safe driving, improving passenger comfort and 
enhancing traffic efficiency. 

• No power constraints. 

• Variable network density 

• Rapid changes in network topology 
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• Large scale networks 

• High computational ability 

The key role that VANETs can play in the realization of 
ITS has attracted the attention of major car manufactures and 
they continue to incorporate more and more technological 
features into their vehicles [4]. It is reported that over 50% of 
interviewed consumers are highly interested in the idea of 
connected cars, 22% of whom are willing to pay $30-65 per 
month for value-added connectivity services while on the road 
[7]. However, there are lots of challenges in this field. Authors 
in [6] list the issues as follows: 


Therefore, Heterogeneous Vehicular Networks may well 
support the communication requirements of the ITS. It is clear 
that a car that takes part in such a network is equipped with a 
WLAN and cellular communication device [3]. 

The rest of this paper is organized as follows: In section II 
we present some proposed architectures for vehicular networks 
while in section III QoS concepts are described. In section IV 
we review previous works and in section V the proposed 
architecture and protocol stack are given before the conclusion 
in section VI. 


• Signal fading 

• Bandwidth Limitation 

• Connectivity 

• Small effective diameter 

• Security and privacy 

• Routing 

Because of the challenges, limitations and new 
requirements in VANETs, the idea of Heterogeneous Vehicular 
Networking has emerged recently. 


C. Heterogeneous Vehicular Networks 


Heterogeneous Vehicular Networks (HVN) integrates 
cellular networks with Ad-hoc networks which is a potential 
solution for meeting the communication requirements of the 
ITS. Although there are a plethora of reported studies on either 
DSRC or Cellular Networks, joint research of these two areas 
is still at its infancy. 

Emerging heterogeneous networks not only have the ability 
of providing wide-area coverage to all vehicles in large-scale 
networks, but also supports real-time safety messages 
distribution in local areas in order to reduce traffic accidents. 
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II. vanet Architecture 

This part describes the system architecture of VANETs. We 
first introduce the main components of VANETs architecture 
from a domain view. According to [27] and [28] we are able to 
achieve the VANETs system by entities which can be divided 
into three domains: the mobile domain, the infrastructure 
domain, and the generic domain [29]. Authors in [6] describe 
the main system components as follows: Application Unit 
(AU), On Board Unit (OBU) and Road Side Unit (RSU). 

An OBU is a wave device usually mounted on-board a 
vehicle used for exchanging information with RSUs or other 
OBUs. The OBU connects to the RSU or to other OBUs 
through a wireless link based on the IEEE 802.11 p radio 
frequency channel, and is responsible for the communication 
with other OBUs or with RSUs. 

The AU is the device equipped within the vehicle that uses 
application provided by the provider using the communication 
capabilities of the OBU. 

The RSU is a wave device usually fixed along the road side 
or in dedicated locations such as at junctions or near parking 
spaces. The RSU is equipped with one network device for a 
dedicated short range communication based on IEEE 802.11 p 
radio technology, and can also be equipped with other network 
devices so as to be used for the purpose of communication 
within the infrastructural network (Fig. 2-4). Typically the RSU 
hosts an application that provides services and the OBU is a 
peer device that uses the services provided. The application 
may reside in the RSU or in the OBU; the device that hosts the 
application is called the provider and the device using the 
application is described as the user. Each vehicle is equipped 
with an OBU and a set of sensors to collect and process the 
information, then send it on as a message to other vehicles or 
RSU through the wireless medium [6]. The main functions and 
procedures associated with RSU are: 

• Extending the communication range of the Ad-Hoc 
network by re-distributing the information to other 
OBUs and by sending the information to other RSUs in 
order to forward it to other OBUs. 

• Running safety applications 

• Providing internet connectivity to OBUs 


Figure 1. VANET Architecture [8] 
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Figure 2. RSU extends the range of the ad hoc network [6] 




Figure 4. RSU provides internet connectivity to the OBUs [6] 

However, this architecture could not support all 
requirements and applications, therefore to remedy the 
drawbacks of existing vehicular networks, new ITS network 
architecture is needed in order to support various services 
under dense vehicular environments. Authors in [3] describe 
the framework of Heterogeneous Vehicular Networks (HVN) 
as follows: 
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Figure 5. Illustration of the unified HetVNET framework [3] 
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As illustrated in Fig. 5, a HVN is composed of three main 
components, namely a Radio Access Control (RAN), A Core 
Network (CN), and a Service Center (SC). Service providers 
can often supply a variety of services to vehicular users through 
the SC. The CN is a key component of the HVN because it 
provides many important functions, such as aggregation, 
authentication, switching and so on. 

Authors in [4] present an overview of integration of 
VANET and WiMAX. Architecture of VANET based on 
WiMAX consists of several logical network entities including 
subscriber station (SS) or Mobile Station (MS), Access Service 
Network (ASN) and Connectivity Service Network (CSN) [9], 
[10]. As it is illustrated in Fig. 6, the SS is for fixed device 
terminal and it is not required to support handover capability. 
The MS providing handover function is installed or embedded 
in car for VANET and it should support handover. ASN is a set 
of network functions to provide wireless connection and 
WiMAX system profile. These functions are including media 
access control for MS, transfer of authentication, authorization 
and accounting (AAA) messages by RADIUS or diameter 
preferred network discovery and selection, radio resource 
management and IP connectivity. 


Figure 6. VANET architecture with mobile WiMAX 

ASN is composed of BS and ASN gateway which connects 
several BSs based on cell planning. Local server is required for 
VANETs applications. The local server processes collected 
information from the MSs in vehicles and sends warning 
messages to MSs [4]. The message type depends on features, 
dangers of collisions, accident information and so on. CSN is a 
set of network functions that provide IP connectivity service to 
MS. CSN comprise network elements such as router, gateway 
for internetworking and various kind of servers. These servers 
are including DHCP for IP address allocation, AAA 
proxy/server, user database, home agent for mobility 
management, central server for VANET application and so on 
[11]. 

III. QUALITY OF SERVICE 

Quality of Service (QoS) is the ability of a network to 
provide improved service to selected network traffic over 
various underlying technologies, including frame relay, ATM, 
Ethernet, SONET, and IP -routed networks and it offers 
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flexibility, scalability, efficiency, adaptability, software 
reusability, and maintainability. QoS is also defined as a set of 
service requirements that needs to be met by the network while 
transporting a packet stream from a source to its destination 
[12], in fact it is the measure of how good a service is as 
presented to the user [13]. QoS provisioning often requires 
negotiation between host and network, call admission control, 
resource reservation, and priority scheduling of packets [14]. 
QoS can be rendered in network thorough several ways: per 
flow, per link, or per node [14]. Characteristics of network such 
as lack of central coordination, mobility of hosts, and limited 
availability of resources make QoS provisioning very 
challenging [15]. In particular, QoS features provide improved 
and more predictable network service by providing the 
following services [16]: 

• Supporting dedicated bandwidth 

• Improving loss characteristics 

• Avoiding and managing network congestion 


exceeded a specified number or rate. For example, when 
streaming video frames, after certain number of lost 
frames, the video streaming may become useless, this 
number may be zero in certain cases. Therefore, certain 
guarantees on the number of rate of lost packets may be 
required by certain applications for QoS to be considered. 
Packet loss can occur because of packet drops at 
congestion points when the number of packets arriving 
significantly exceeds the size of the queue. Corrupt 
packets on the transmission wire can also cause packet loss 

[17]. 

There are numerous levels of QoS and those levels have 
been grouped into three main categories: 

• Best Effort Services : Best Effort is a single service model 
in which an application sends data whenever it must, in 
any quantity and without requesting permission or first 
informing the network. For best-effort services, the 
network delivers data if it can, without any assurance of 
reliability, delay bounds, or throughput [16]. 


• Shaping network traffic 

• Setting traffic priorities across the network 

As it is mentioned, QoS is quantitatively defined in terms of 
guarantees or bounds on certain network performance 
parameters. The most common performance parameters are the 
bandwidth, packet delay, jitter, and packet loss [17]: 

• Bandwidth: The term bandwidth defines the transmission 
capacity of an electronic line. Theoretically, it describes 
the range of possible transmission rates, or frequencies. In 
practice, it describes the size of the pipe that an application 
program needs in order to communicate over the network. 
The significance of a channel bandwidth is that it 
determines the channel capacity, which is the maximum 
information rate that can be transmitted. 

• Delay : Network delay is an important performance 
characteristic of a computer or telecommunication 
network. The delay of a network specifies how long it 
takes for a bit of data to travel across the network from one 
node or endpoint to another. It is typically measured in 
multiples or fractions of seconds. Delay may differ 
slightly, depending on the location of the specific pair of 
communicating nodes. Although users only care about the 
total delay of a network, engineers need to perform precise 
measurements. Thus, engineers usually report both the 
maximum and average delay, and they divide the delay 
into several parts; propagation delay, transmission delay, 
queuing delay and processing delay. 

• Jitter. Jitter is defined as a variation in delay of received 
packets. The sending side transmits packets in continues 
stream and spaces them evenly apart. Because of network 
congestion, improper queuing, or configuration errors, the 
delay between packets can vary instead of remaining 
constant [18]. 

• Packet loss : Packet loss is another important QoS 
performance measure. Some applications may not function 
properly, or may not function at all, if the packet loss 


• Integrated Services : Integrated services is a multiple 
service model that can accommodate multiple QoS 
requirements. In this model the application requests a 
specific kind of service from the network before it sends 
data. The request is made by explicit signaling; the 
application informs the network of its traffic profile and 
requests a particular kind of service that can encompass 
its bandwidth and delay requirements. The application is 
expected to send data only after it gets a confirmation 
from the network. It is also expected to send data that 
lies within its described traffic profile. [16]. 

• Differentiated Services : In this QoS level, no absolute 
guarantees are given. Rather, different priorities are 
assigned to different tasks. Hence, applications are 
grouped into different classes of priorities. Many 
application traffics work very well with this policy when 
absolute guarantees are not needed. For example, 
network control traffic should always be given higher 
priority over other data communications to ensure the 
availability of, at least, the basic connectivity and 
functionality at all times [17]. 

Providing QoS support in ad-hoc networks is a dynamic 
research area. These networks have certain inimitable 
characteristics that facade several intricacy in QoS provisioning. 
The characteristics that affect QoS provisioning in these 
networks are: dynamic varying network topology, inaccurate 
state information, lack of central coordination, error prone 
shared radio channel, hidden terminal problem, limited resource 
availability and insecure medium [14]. There are approaches 
designed for QoS provisioning in MANETs but they are not 
suitable for VANETs, because they do not consider the high 
mobility constraints and large scale node population [19]. QoS 
parameters such as throughput, latency, jitter, and packet loss 
are key requirements in VANETs [20]. Each application in 
VANETs has its own requirements, for example; safety warning 
applications should have minimum End to End (E2E) delay, 
because if a warning message receives at destination with high 
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delay, that message could not be helpful for preventing an 
accident. Accordingly, packet loss and throughput are two other 
factors that are very important in active safety applications [13]. 


IV. Previous works 

A. Improving QoS in VANET Using MPLS 

Authors in [13] divide vehicular communications into two 
categories, Vehicular Ad-hoc Networks which includes V2I 
and V2V communications and Roadside Network which 
consists of Roadside Access Network (RAN) and Roadside 
Backbone Network (RBN). RBN represents the backbone 
network of RSUs, in which RSUs communicate with each 
other and with the internet [21]. They assumed that each 
vehicle is covered by a base station, which has its own domain 
of service, and base stations are connected with a wired 
network named RBN and then, they used MPLS in wired 
domain. MPLS is a forwarding method which can assign 
packets to different forwarding equivalent class (FEC) for 
receiving the required service from the network to support 
QoS. MPLS is considered as layer 2.5 protocol [21] and it is 
compatible with any layer 2 technology, like Ethernet and 
ATM. They also used AODV as a wireless ad-hoc routing 
protocol, because AODV imposes less overhead to the 
network. Finally they used SUMO [22] to design Manhattan 
mobility model and then they exported the output of SUMO to 
NS 2. 34 for the main test. Results show that with the help of the 
proposed idea in [13], better performances in terms of E2E 
delay, packet loss and throughput is achieved. 



Figure 7. Vehicular Communication Pattern in [13] 
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B. Utilizing Mobile IP and MPLS to Improve QoS in VANET 

Mobile IP is the current standard for supporting IP 
mobility of mobile nodes in wireless networks with 
infrastructure [23]. Mobile IP enables the mobile node to 
access internet and changes its access point without losing the 
connection [23]. Mobile node (MN), Home Agent (HA), 
Foreign Agent (FA) and Care-of-Address (CoA) are main 
components of Mobile IP. When the MN moves away from 
HA to the foreign network, a CoA is assigned to it in order to 
inform the HA of its current location. This operation enables 
MN to send and receive at any location without going through 
HA [24]. Authors in [24] used Mobile IP, MPLS based 
backbone and AODV routing protocol to improve the QoS in 
VANET. They used city which was simulated in [13] with 
SUMO [25] and then exported the outputs of SUMO to 
NS. 2. 34 to implement the communication network. Their 
results show that using Mobile IP (in comparison to the 
proposed idea in [13]) doesn’t have positive effect on delay 
but, better performances in terms of packet loss and throughput 
are achieved. 
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C. Improving QoS in VANETs by detecting and Removing 

Unused Messages 

Authors in [26] tried to increase the performance of 
VANETs by removing the useless or unused packets. They 
considered the following scenarios: 

Scenario 1: consider a highway that has at least two lines 
for car traffic (Fig. 14). Suppose that car 1 brake abruptly. In 
this vehicle, emergency electronic brake light application sends 
a message in its area. In this way other vehicles that receive the 
message must have a proper reaction. Vehicles that are in the 
same line and are behind the carl - such as 4 and 5 - after 
receiving and processing of the received message from car 1 
they must reduce their speed [26]. Although car 3, 6, 7, 8 and 2 
receive these messages and after receiving the safety message 
they can remove it. In this special safety application, the 
position of vehicles has influential effect on their reactions 
[26]. According to this scenario if car 3 brakes and sends a 
safety message, car 1, 4 and other cars receive this message, 
but according to their position they do not have to do any 
reaction. So all cars which receive this message do not need to 
process it and without any processing they can drop it. If we do 
not have this idea, each car which receives the safety message 
should process it and according to the type of that message, 
each car should do a reaction [26]. 
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Figure 14. Impact of vehicles position [26]. 

Scenario 2: In this scenario as shown in Fig. 15, suppose 
that car 1 brakes abruptly and sends a safety message over its 
area. 



Figure 15. Impact of distance between vehicles [26] 

Each car which receives the sent message will be forced to 
react and send a safety message according to its condition. If 
we review the scenario, we will see that the received safety 
message for vehicles far from the source, vehicle such as 4 and 
5, is less important that closer ones [26]. In this scenario all of 
the cars are in the same lane and according to the previous 
scenario, all of them must process the message after receiving 
and then show a proper reaction according to the type of the 
received message [26]. But we know that when car 1 braked, 
car 2 which is the nearest car behind it, must react quickly. Car 
3 which is so far away from car 1 does not need to do any 
reaction because of its distance to car 1. In this idea each 
vehicle must be able to compute the distance between itself and 
another [26]. 

Simulation results show that with the help of the proposed 
idea in [26], better performance in terms of Message Expiration 
Ratio is achieved. 



Figure 16. Simulation result before applying the proposed idea in [26] 
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Figure 17. Simulation result after applying the proposed idea in [26] 

V. Proposed architecture and protocol stack 

A. Proposed Architecture for Vehicular Networks 

In this section we are going to introduce our proposed 
architecture for vehicular networks. As it is illustrated in Fig. 
18, in our proposed architecture, geographical regions are 
divided into 25 unique areas and in each area there are 9 
WiMAX base stations which provide wireless 
services to the vehicles and they are connected together with a 
wired network. These WiMAX base stations operate as a 
wireless switch for in-cell communications and a gateway for 
out-of-cell communications. Cars communications is also 
restricted, each car could communicate only with other cars 
and base stations in other 24 areas around it (Fig. 18). 
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B. Proposed Layer 3 Protocol 

VCNP (Vehicular Communication Network Protocol) is our 
proposed layer 3 protocol for vehicular communications. 
VCNP header is illustrated in Fig. 20. There are some 
differences between VCNP and Internet Protocol (IP). As we 
know, there are four octets for each of source and destination 
address fields in IP but in VCNP we propose to use 3 octets 
instead of four . The first octet represents the area, the 
second octet represents the base station and the third octet 
represents the vehicle, so any node will have a unique layer 3 
address and according to the restricted communication 
domain, we will be able to reuse layer 3 addresses several 
times in other areas. There is a one-bit field, M, which shows 
the last packet of the stream, whenever M is set to 1 , it shows 
that there are more packets to come and when M is set to 0, it 
means that the stream is finished. We also eliminated the 
Fragment Offset and Flag fields, because according to layer 2 
technologies and Maximum Segment Size (MSS) we could 
estimate a constant size for layer 3 packets, therefore routers 
do not have to fragment packets and both header and packet 
size will be constant. Version field is also eliminated and the 
QoS field is reduced to 3 bits. Other fields of VCNP are the 
same as IP header fields. 


IHL (4) 

QoS 

0) 

M 

U) 

Total length (16) 

ID (16) 

Protocol (8) 

TIL (8) 

Checksum (16) 

Source Address (24) 

Destination Address (24) 


Figure 18. Regions are divided into 25 areas 


Fig. 20. VCNP header 
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C. Proposed Layer 4 Protocol 

Vehicular Communication Transport Protocol (VCTP) is our 
proposed transport protocol for vehicular communications. 
TCP and UDP have their own advantages and disadvantages. 
According to special characteristics of VANETs, we propose a 
new transport protocol aiming to provide better rates in terms 
of throughput, delay and packet loss. 

TCP is a connection-oriented protocol and is suited for 
applications that require high reliability and less transmission 
rate. TCP is also a heavy-weight protocol and does error 
checking, handshaking and flow control, therefore it is not a 
suitable transport protocol for VANETs with those mentioned 
characteristics. On the other hand, UDP is suitable for 
applications that need fast and efficient transmission. UDP is 
faster because it is a light-weight protocol and there is no error 
checking, flow control, etc. and it is a small transport layer 
designed on top of IP. 

In this paper we propose a new transport protocol aiming to 
provide the high throughput of UDP, as well as the packet loss 
rate of the TCP. VCTP is similar to UDP in terms of source 
port, destination port and checksum fields but in contrast to 
UDP, there is a recovery option as well as a handshaking 
process. VCTP also guarantees that the sent packets will reach 
to destination. VCTP header is illustrated in Fig. 21. 


Source Port 
(16) 

Destination 
Port (16) 

Seq #(36) 

Syn 

(2) 

Fin 

(2) 

Checksum (16) 


Figure 21 . VCTP header 


D. VCTP Algorithm 

Application layer sends the produced data to transport layer 
and then according to MSS, transport layer divides the whole 
data into segments and sends them to the destination. We 
assume that application layer produces some data and 
transport layer wants to send these data in 1000 segments. 
VCTP operates as follows: 

1- At first, source sends a segment to the destination, in this 
segment “Syn” is set to “01” and “Seq#” is set to 
“01111101000” or “1000” in decimal system. It shows that 
source wants to establish a connection and send 1000 
segments. 

2- If destination is ready for data exchange, it will send a 
segment to source. In this segment “Syn” is set to “10” and 
“Seq #” is set to “01111101000”. It shows that destination is 
ready to exchange data and knows that 1000 segments will be 
sent (Fig. 22). 

2- 1- If destination did not catch the segment that was sent in 
part 1, after a while, source sends that again. 

3 - Source starts to send data, when each segment is sent, the 
“Seq#” will be increased in the next segment. For example in 
the first segment, “Seq#” is set to “01”, and in the second 
segment “Seq #” is set to “10”. 
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DP- Das fin a do d Port 

Figure 22. Handshaking Process 

4- When source finishes sending data, it sends a segment to 
destination. In this segment, “Fin” is set to “01” and it shows 
that source has finished sending data. 

5- If the destination node, receive this segment: 

5-1- Destination will check if it has all one thousand segments. 
If yes, a segment will be sent to the source by destination and 
in this segment “Fin” is set to “11” and it shows that 
destination has received all the 1000 segments and is ready to 
finish the connection (Fig. 23). 

Source Destination 


SP DP Seq# Sjn Fin cluarksTim 



Figure 23. Sending data and finishing process 


5-2- If the destination node finds that some segments are lost, 
it will send some segments to inform the source about the lost 
segments. In these segments “Fin” is set to “10”. For example 
if destination do not receive #200 and #201, it will send two 
segments to source, in both of them “Fin” is set to “10” but 
“Seq#” in the first one is set to “11001000” and in the second 
one is set to “11001001”. It means that, destination has not 
received segments with mentioned sequence numbers. 

5-3- Source will send #200 and #201 segments to the 
destination immediately, and repeats the finishing process 
which is explained in part 4 (Fig. 24). 
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Figure 24. Retransmission Process 

-If the destination do not receive the sent segment in part 4, 
after a period of time, source will send it again. 


In the simulation scenario, there are 20 vehicles and 4 base 
stations. Vehicles communicate with each other via base 
stations, so the communication type is Vehicle to Roadside to 
Vehicle or V2R2V. 

After the simulation process we compared the achieved 
results with another scenario in which TCP/IP protocol stack 
was used. Simulation results show that better performance in 
terms of throughput is achieved. Packet loss rate and delay are 
also improved. 

Wsec Throughput 

160000 
140000 
120000 
100000 
80000 
60000 
40000 
20000 
0 

Proposed Protocol Stack TCP/IP 

Figure 25. Throughput 

?aaax Packet Loss 



E. Simulation Results 

We used OMNET++ to simulate our proposed idea. 
OMNET++ is an open-source, component-based simulation 
package built on C++ foundations. Simulation parameters are 
depicted in table 1 . 

Table 1 : Simulation Parameters 


Network Area 

1000* 1000 m 

Channel Type 

Wireless 

Radio Range 

500 m 

Traffic Type 

CBR 

Visualization 

OMNET++ 

MAC 

IEEE 802.16 

Routing 

Static 

Number of Vehicles 

20 

Number of Base Stations 

4 

Vehicles Speed 

40-80 Km/h 

Packet Size 

1000 

Transport Protocol 

VCTP 

Duration 

60s 

Radio Propagation 

Two Ray Ground 

Queue Type 

Drop Tail 

Addressing Type 

Hierarchical 3 level 


0.1 
0.09 
0.06 
0.07 
0.06 
0.05 
0.04 
0.03 
0.02 
001 
0 

Proposed Protocol Stack TCP/IP 

Figure 26. Packet Loss 
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Figure 27. Delay 
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VI. Conclusion 

In this paper we presented a short overview of vehicular 
networks architecture, QoS concepts and QoS provisioning in 
vehicular communications. We proposed a novel architecture 
and protocol stack, aiming to improve QoS and security in 
vehicular networks. Finally we simulated our proposed idea 
and compared the achieved results with a similar scenario in 
which TCP/IP protocol stack was used. Results show that our 
proposed protocols provide better rates in terms of delay, 
packet loss and throughput. The type of communication that 
was used in our simulation was Vehicle to Roadside to Vehicle 
(V2R2V). Moreover, we did not use any specific routing 
protocol. Therefore, in the future works a routing protocol will 
be used and we will implement our proposed idea on other 
types of vehicular communications like V2V and V2I 
communications . 
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Abstract — Transition from IPv4 to IPv6 is a cumbrous process 
because of their irreconcilability with each other and coexists 
during the transition period. This work examines the behavior of 
transition mechanisms that involve communication among IPv4 
and IPv6 in various scenarios and traffic conditions. A network 
analyst faces variable traffic and data rates at different nodes in 
such a heterogeneous network, that requires more attention to 
make it able to work with stable network flow and data rate. We 
analyse an end-to-end delay of VOIP data packets in IPv4 and 
IPv6 homogeneous and heterogeneous networks using 6 to 4 
tunneling techniques. This work shows that IPv6 has better 
performance than IPv4 and IPv6-to-IPv4 tunneling. The 
tunneling technique improves the network throughput and 
queuing delay over the intermediate nodes of the heterogeneous 
network. 

Keywords: IPv4, IPv6, VoIP, 6- to-4 tunneling, DSTM 
I. Introduction 

Routing devices are needed for traffic exchange in 
interconnected networks. In case of dynamic routing the router 
makes its tables by broadcasting informative messages. 
Routing protocols determine the shortest path to destination. 
Based on traffic and routes availability, the routers are updated 
accordingly. 

Internet Protocol shortly IP is a transmission technique for 
data on internet. Its current address space is 32 bit. Often other 
protocols are used to complement with it in making sure that 
data has been transferred to its required destination, as every 
device is uniquely identified by IP-address in a network. IP is 
a connectionless protocol and is not concerned with the 
delivery and order of data. Also it doesn’t give any 
information about packet loss during transmission. In future 
internet will face a problem of limited addresses and no new 
host will get a chance to be connected with the internet. The 
increasing usage of internet through different devices i.e. 
mobiles, PC, tablets etc. require large number of IP addresses. 
IPv6 not only replaces IPv4 to achieve a large address space of 
128 bits, but also provides extra facilities like high security, 
QOS, Mobility, Simple header formats. The new version of 
Internet protocol i.e. IPv6 is getting importance due to 
scalability, multimedia transmission and elimination of NAT 
requirement. Most of the networks are based either IPv4 or 


IPv6 and connecting both the networks is the hard issue of 
today’s communication systems. 

Section II discusses the objectives of the research. Section III 
discusses the proposed and its analysis. Section IV summaries 
and concludes this work. 

II. Literature Review 

The digital world is switching over rapidly from IPv4 to IPv6 
due to the shortage of IPv4 addresses, huge routing tables, 
security issues, mobility and Quality of Service that over the 
Internet. High speed networks and extra IP addresses are 
required to every person to become a part of the globally 
connected network. New protocols are designed to fulfil the 
end user demands and face new challenges of the digitally 
communicated world. Internet protocol IPv6 has the improved 
features over IPv4 that overcome IP address shortage and 
numerous IP addresses are available to assign them to each 
individual node [1]. A number of applications still support 
IPv4 only and require communicating with other applications 
over IPv6 enabled networks. 

Abrupt migration from IPv4 to IPv6 is not possible and still it 
may take years to completely replace IPv4 over the Internet. 
Various migration techniques like Dual Stack, Translation and 
Tunneling Mechanisms are used to make IPv4 functional with 
IPv6. IPv4 is compared with IPv6 in [4] which shows that in 
low traffic load IPv4 perform better than IPv6. The impact of 
IPv6 transition mechanism is discussed in [6] which show that 
the performance overhead is minimal but the translation 
packet degrades its performance. VoIP is compared on LAN 
using Background UDP that shows IPv6 has more packet loss 
than IPv4 in high congestion and have poor voice quality [8]. 
Dual Stack Transition Mechanism(DSTM) provide better 
reliability and low data loss as compared to IPv4 having a long 
queue delay due to encapsulation and de-capsulation overhead 
at the end points of the channel. 

Combining Tunneling and Dual Stack Mechanism improves 
reliability and reduce packet loss. Increasing the packet size 
(>1000 bytes) the queuing delay increases which results in 
overall low through put in DSTM, in that case IPv4 networks 
performance is better. The transition 6 over 4 mechanisms 
using IPv4 multicast tunneling and their constraints are 
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discussed in [7]. This mechanism is suitable for small 
networks but having scalability issues in large networks. 
Internet uses both IPv4 and IPv6 addresses which require 
frequent translation of the IP addresses wherever required. 
NAT-PT translates IPv6 addresses to IPv4 [3]. NAT-PT use 


Scenario 

Point-to-Point 

Throughput 

(Packets/sec) 

Point-to-Point 
Queuing Delay 

Packet End-to- 
End Delay 

IPv4 

41.7 

0.000017628 

0.060191 

IPv6 

114.51 

0.00001401 

0.06009 

6-to-4 

Tunneling 

123.06 

0.00001387 

0.060185 


different ports for the hosts which increase its limit to 63K 
hosts to overcome the IPv4 address shortage. NAT-PT is 
limited to TCP and UDP connections and does not support 
DNS and FTP that has been resolved by using Application 
Level Gateway (ALP). The VoIP data traffic on FTP using 
translation technique face problems highlighted in [3] and 
degrades integrated network performance [6]. 

In the first phase of tunneling networks use IPv4 protocol and 
small island of IPv6. In second phase IPv4 is encapsulated 
with IPv6 tunnel that migrate IPv4-to-IPv6, which makes IPv6 
network able to communicate with IPv4 networks. Various 
tunneling techniques are discussed in [5]. 



Figure 1: Tunneling Techniques used in IPv4-to-IPv6 Networks. 
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III. Network Architecture of IPv 4 & IPv6 network 
USING OPNET 

Two 6-to-4 sites A and C are simulated in the scenario along 
with a relay site B. There are two 6-to-4 sites in the network, 
Site A and Site C and are connected to IPv4 backbone. Router 
A and C have defaults routes to site B using 6-to-4 tunnels. 
Any packet from A or C (6-to-4 sites) to site D (IPv6 site) is 
first tunnelled to site B router by the use of 6-to-4 tunnel and 
then sent to Site D. 

Router in site D IPv6 network has static route to destination 
(2002:: /16) for which next hop is set to Router B. Any packet 
destined for A or C (6-to-4 sites) is sent to Router B and 
tunnels the packet to destination. 



Figure 2: IPv6-to-IPv4 Tunneling Scenario 




VoIP has better performance over IPv6 networks than IPv4 
networks [9]. Three parameters delay, packet loss and overall 
throughput is observed in IPv4, IPv6 60 to 4 tunneling, 
resulting IPv6 < 6 to 4 tunneling < IPv4, IPv6 < 6 to 4 
tunneling < IPv4 and 6 to 4 tunneling < IPv6 < IPv4 
respectively [10]. They consider text and graphical data and do 
not focus on voice data traffic. Voice data analysis for the 
aforementioned parameters is still required to analyze over 
various types of networks, which reflects the end-users in real- 
time. This work presents a comparative analysis of IPv4, IPv6 
and 6-to-4 tunneling networks. 

This paper presents a comparative study of the behavior of 
IPv4-only network with IPv4 integrated with IPV6 networks 
using 6-to-4 mechanism. The performance metric is mean end- 
to-end delay for both the cases because in the previous 
research papers, other metrics such as latency, throughput, 
CPU utilization and Loss Rate analysis are carried out but the 
researchers give a little exposure to the mean end-to-end delay 
so this is the motivating force behind our work. Comparison of 
VoIP performance will be analyzed on IPv6 and IPv4 LANs in 
the presence of varying levels of background traffic. 


IV. Performance Evaluation 

The model is evaluated from different angles; their results are 
collected at different levels. Here we analysed collected 
information in different ways. 

4.1 Throughput, Queuing Delay and End to End delay 

Table 1: Results collected from the Designated Model at Different Modes 

Table 1 shows that the throughput of 6 to 4 tunneling is three 
times to IPV4 and almost similar with IPV6. Tunneling 
increases the throughput and it is obvious that Queuing delay 
for workstation ‘B’ in 6-to-4 tunneling router is least delayed 
because it is connected to a relay router. Workstation ‘D’ in 
IPv6 environment which is connected to IPv6 backbone has 
the second lowest delay as compared to other two stations i.e. 
workstations ‘A’ and ‘C’ which is connected to IPv4 
backbone. IPv6 has least delay as compared to IPv4 and hence 
IPv6 perform better than IPv4. Relay router further reduces the 
queuing delay in 6-to-4 tunneling network. Packet end-to-end 
delay and throughout is stable and well in limits. 
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Workstations connected to their respective routers observe 
similar results are graphically represented in Figure 3, 4 and 5. 
Workstations connected to IPv6 networks has better 
throughput utilization as compared to the workstations 
connected directly to IPv4 networks as can be seen in the 
given figure. 


gj time_average (in point -to-point.utilization) 



Figure 3: Utilization 

S3 point-to -point. throughput (packets/sec) I <=> ll~[=) l kaE3j 



Figure 4: Routers (Point-to-point throughput) 

S3 time_average (in point -to-point.throughput (packets/se... I o II B 

■ Object: wkstn C <-> Router C [0] of Office Network <— 

■ Object: wkstn D <-> Router D [0] of Office Network <— 

time_average On point-to-point. throughput (packets/sec)) 



Figure 5: workstation (Point-to-Point throughput) 

From above graphs workstation C has much lower throughput 
then workstation D (three times lower), but the queuing delay 
for D is same as station C. The reason for same queuing delay 
for C and D is that they are inside their networks. 

4.2 Investigating Relay Router 

From above results since relay router is producing excellent 
results for throughput and queuing delay, it is important to 
further investigate this router. Throughput and queuing delay 
of relay router to IPv6 router is better than towards IPv4 
router. The point to point utilization between the IPV6 
network and IPv4 network is same. 



Figu re 6: Point-to-Point throughput 

S3 point-to-point. utilization I <=■ II a ll^ r^i 



Figure 7: Point-to-Point utilization 



4.3 Voice calls made to workstation D 

Now let’s break up the packet delay variation graph for voice 
calls made to workstation D. 

The packet delay variations for voice calls made to 
workstation D from workstation A, B and C. 


S3 time_average (in Voice Calling Party.Packet Delay Variation) r^~ii b i nrej 
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Figure 10: Voice Application packet delay variation 
It is obvious that the packet delay variations for voice call 
coming from IPv4 networks (i.e. from workstation A and C) 
are much higher. Since Workstation B is connected to a relay 
router and making call to D station in IPv6 environment, it has 
the least packet delay variations. 


Workstation VoIP call 

Packet End-to-End Delay (sec) 

C to D 

0.060191 

A to D 

0.060185 

B to D 

0.06009 


Table 2: End-to-End Delay (sec) in Voice Calls 


We observe the overall network jitter and packet delay 
variation. Results show that the network has presented an 
acceptable jitter and packet delay variations for VoIP as per 
international standards. 

Ea Voice. Jitter (sec) [~^~| fWI lii^iiil 


Voice. Jitter (sec) 
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We analyze the performance of IPv4-only and IPv4/IPv6 
integrated networks by using the framework of the 6-to-4 
tunneling. The behavior of IPv4 and integrated IPv4/IPv6 is 
analyzed by different angles i.e. throughput, queuing delay, 
jitter and mean end-to-end delay. The 6-to-4 tunneling better 
performance than IPv4 networks in all these tests and the 
overall end-to-end delay is reduced to a significant level in 
heterogeneous network. Other tests on jitter and packet end-to- 
end delay prove that IPv6 has better performance than IPv4 
enabled networks. 
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Figure 11: Voice Jitter 



Figure 12: (voice packet delay variation) 

Figure 11, 12 shows that jitter becomes almost predictable and 
stable gradually when a load is applied over the networks. 

V. Conclusion 
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Abstract — Today as Android is used by majority of the 
smartphone users it has become one of the effortless platform 
for the malware-writers to introduce their malicious activities 
into smartphone world through this android mobile applications. 
The main loophole in Android applications is permission based 
security control. The User unawareness of accepting every 
permission as a mandatory requirement by an app is making 
more and more convenient for the hackers to extract the users 
private data. In this paper we have analysed all the leakages 
which are done by using permissions required by an app. We 
carefully made an investigation to detect collusion attacks .We 
analyzed the present detection methods of inter-permission leaks 
especially on Collusion attacks and mentioned the areas where 
the enhancements are needed with limitations that existed in 
present detection methods. 

Keywords- Collusion attacks, inter-permission leaks 

I. Introduction 

The Excaberation of smartphones has extended the use of 
mobile apps .At around 2.6 billion users are using the 
smartphones and as per analysis and estimation of various 
surveys the mobile usage may raise by 6.1 billions by 2020[5 
].As a result there is an instantaneous increase of mobile apps 
in different app play stores. At present Google play store has 
1.6 million android apps and AppStore of Apple has 1.5 


million apps [4] [5 ]. All the Android apps follow the classical 
and traditional permission based access control as a 
centralized control mechanism. For an app to be installed into 
a Smartphone user should accept all of the required 
permissions mandatorily, this mandatory acceptance of each 
and every permission creating an opportunity to intruders to 
introduce the malware. As per statistics given by appvigil 
analysis reports [16] 98% of the present apps in different app 
stores are pregnable as these apps require the permissions 
which are inexpedient to the app functionality. This 
permission adoption is giving an entry to the intruders to 
launch intrusions through mobile apps by which most of the 
private user data is released. 

This immensive explosive growth of apps became a channel 
for the introduction of many types of attacks by which most of 
the mobile users private content is revealed. The intruders 
find the inadequacy in the accepted permissions in the existing 
apps or with their own strategies they are developing their 
own apps with permissions insignificance to the apps 
functionality and finally the user’s sensitive data from the 
Smartphone’s is extracted. This type of problem is pointed as 
data Leakage done by using permissions or “Permission 
leakage “ which is one of the most dangerous attacks by which 
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legally extraction the Smartphone user’s private/sensitive 
information is done. 

The Permission leakage attacks [2] are of three types known 
as Confused deputy attacks, Intent Spoofing and Permission 
collusion. Confused deputy attacks completely depend on 
misconfigured applications. Intent spoofing [2] is a form of 
confused deputy attack which affects applications that are not 
meant to communicate with other applications. Collusion 
attacks [2] [10] uses overt and covert channels and aggregates 
the permissions from different apps and releases user sensitive 
data. These type of collusions attacks are difficult to detect 
and causing a great deal of research in the mobile applications. 
We analyzed some of the methods that detects the colluding 
applications and we enhanced the present detecting models of 
colluding applications. 

There are many attacks through apps of which collusion 
attacks , confused deputy attacks and intent spoofing attacks 
are the permission leakage attacks. At present there are 
different tools and methods to avoid these types of attacks but 
the existing classical permission model in smartphone needs to 
be intensified and give the user necessary indications to the 
user on the acceptance of each and every permission required 
by the app. 

To give appropriate directions to the mobile users on 
classical permission model we proposed a framework that 
investigates the app and its permissions and finally points out 
if the app requires a pertinent permissions based on its 
functionality or not. 

The rest of the paper is organized as .In section-2 we 
presented the problem of application collusion in smartphones 
along with examples. In Section-3 we presented the inference 
on collusion attacks .In Section-4 we presented the analysis of 
collusion attacks by using present detectors and their 
limitations. In Section-5 we presented the future scope in the 
detection of collusion attacks and finally we draw conclusions 
on the collusion attacks detection and scope. 


II. APPLICATION COLLUSION IN SMARTPHONES 

Colluding applications are those applications that collaborate 
in breach of some security policy of the system. These 
applications legally do not individually break any security 
permissions or pervert software vulnerabilities. They 
alternatively use existing channels or new channels are 
constructed for communication to perform malicious actions 
or try to access the unauthorized resources . 



Figure 1. The ContactsOptimizer application on the left and the News application 
on the right colluding through a covert communication channel. The 
ContactsOptimizer does not have access to the network, but has access to 
user’s contacts. The News application has no access to user’s contacts but can 
access the network. The ContactsOptimizer leaks the user’s contacts to the 
News application, which then sends this information to a third party. 


The attack of colluding applications is viable be-cause modern 
present security mechanisms are not focusing on controlling 
the channels where two applications can make a route to 
communicate. Rather, most of the efforts have been made to 
achieve application sandboxing or containment. This is most 
likely due to the fact that there is little concern on tight 
information flow control i.e with overt or covert channels 
typically of personal computers OSs, by which many 
smartphone operating systems are based. 
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A. covert and overt channels on smart phones 

Covert channel is a channel that is deliberately used by 
applications to make communication while it was not 
meant to be used for communication [2]. Covert 
Channels, uses non data objects to transfer information 
from one source application to another. Source 
manipulates system state, such as file lock or busy flag to 
signal information to the Sink. Sink is nothing but 
application that receives data. 

Overt channel is a channel by which application uses a data 
container such as a file or a buffet to hide information [2]. 
Overt Channels uses data objects which are normally 
used as data containers such as buffers, files, and I/O 
devices to transfer the communication from one source 
application to another. 

Overt channels could be controlled by strictly enforcing 
the access control policy while covert channels could be 
controlled by implementing dedicated methods. 

The Covert and Overt channels can be found at different 
levels of abstraction in a system which are as follows 
At the highest level, the level of API that an operating 
system provides the developers forms a channel as an 
example android uses java API this can be taken as the 
highest level ,this is the simplest level where the channels 
can be closed. At Intermediate level the OS is considered 
.This is the level of the operating system which is 
revealed by using native calls that will extract the data 
present in the operating system.This level can be closed 
but causes severe damages. At the lowest level hardware 
level which forms channels for exploitation using the 
smatphone hardware functionalities. This level is 
completely dependent on hardware functionalities and 
channels may not be closed easily. The Covert and Overt 
channels can be found at different levels of abstraction in 
a system which are as follows 

At the highest level, the level of API that an operating 


system provides the developers forms a channel as an 
example android uses java API this can be taken as the 
highest level ,this is the simplest level where the channels 
can be closed. At Intermediate level the OS is considered 
.This is the level of the operating system which is 
revealed by using native calls that will extract the data 
present in the operating system.This level can be closed 
but caus^ 
level w ■ 
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The Figure2 shows the concept of the Broadcast Intent 
overt channel where the sink is nothing but class of 
applications that has access to private data, by using the 
data objects in this overt channel data is transmitted from 
one application to other application as shown in this 
figure 
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Figure 2. The Figure showing visualization of the Broadcast 
Intent overt channel. 


Conled ctxt 

Intent j = new Intent fcol(udjng-id"}; 
contacts", contacts); 
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Figure3. The above figure showing the java code that is 
required for communication between source and sink in which 
the communication contains the private data. 


D. Example of Covert channel 


The Figure4 shows the Java code needed to exchange 
communication through the number of running services. By 
using this method the sink counts the number of services that 
the source application is using and will infer information from 
that , the source for a given amount of time t, either 
precipitate an extra service or not. The Sink requires one 
more extra permission i.e GET_TASKS to utilize the Java API 
and to list the running service 



Figured- . The Figure showing concept of process enumeration 
covert channel 

Processes Enumeration from native code: Figure 5 shows the 
concept of how the sink finds the no. of forked processes of 
source using the native (C) code. In this covert channel the 
sink application counts about how many processes have been 
forked by parsing the /proc/ file system , source application 
forks its process from a native JNI function by fork() call, A 
similar type approach can be used by using pthread_create() 
from the native code as a substitute to forking the process on 
the source side. In this case sink could read the file 
/proc/<PID>/status to count the no. of spawned threads. This 
allows information to be ex-changed between the sink and 
source applications 

I men 1 svc = new Intern t(^h js ■ - elas 3 J ;s la 

irl 5 e rv | c e { svc ) ; 

^topSeirv jjc e{ svc ) ; 

lh i= 

■ . ACTIVITY SERVICE^ 

tor : 5 STVijc.ES} { 

if {rsj pro ces 3 e C P*c It age . jcQ ) { 

I FAG,. "Found : " + r s j . £ r o ce s ; 

} 

} 


Figure5. This above figure showing the implementation of 
code of process enumeration channel used by sink and source 

III. INFERENCE ON COLLUSION ATTACKS 

Most of the collusion attacks are done using the intent- filters. 
Intent filter mechanisms does not provide any guarantee of 
security so it is considered as loose binding between Activity 
(components built for interfaces of mobile devices) and 
intents. Intents are used for communication between activities 
of same application or for inter- application communication 
[2]. In the application manifest which is Android XML 
manifest file the components of application and their 
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encapsulation and permissions required are specified. Most of 
the developers by mistake or by knowing specify the intent 
filters in their way which leads to permission leakage attacks 
like collusion attacks. Considering two points (i)If the Intent- 
filter specified is declared and exported attribute is not set to 
true or false, default value is true which makes the Activity to 
be accessible by any application ,in the same way(ii) if an 
intent-filter is not declared ,the exported attribute is not set ,by 
default Activity accessible though intents whose source is 
from the same application. Until and unless the developer 
specifies sharedUid in the manifest file exception is not 
allowed in the above two points. 


IV. ANALYSIS OF PRESENT INTER-PERMISSION LEAK 
(COLLUSION ATTACK) DETECTORS 

The permissions leakage attacks detection and prevention in 
android mobiles is one of the most seriously considered 
research areas as the majority of the smartphone users are 
utilizing the services provided by android. Most of the 
contemporary methods intensify on detection and prevention 
of Collusion attacks by static and dynamic analysis detection 
.The analysis of different inter-permission leaks along with 
their limitations as follows 

A.MockDroid[13], familiarly known as modified version of 
the Android operating systems permits user to v mock' an 
application's access to a resource. The system of MockDroid 
aims on faking the information that is utilized by the 
application so that the user data not to be divulged and that 
forms a major security of users data. In case of Broadcast 
intents problem as per MockDroid specified in [13], “if the 
permission required to send a broadcast intent from a package 
is mocked”, the broad-cast intent is never sent; likewise, “if 
the permission required to receive a broadcast intent by a 
package is mocked, it is never received”. In this way 


Collusion attacks are protected using Mockdroid by giving 
notifications to user as “Mocks permissions for applications”. 
There are limitations in MockDroid as hiding the notifications 
for applications which use fullscreen and mocking the source 
used by a background service . 

B. Fire-Droid[18], familiarly known as a policy-based 
framework which makes use of interleaving system calls to 
implement security protocols. FireDroid introduces FireDroid 
Application Monitor(FDAM) by using ptrace() which is 
considered as policy tracing, so everytime when the target 
process executes a system call, the kernel suspends the target 
process and notifies the FDAM. FDAM maintains policy 
enforcement policy (PEP) within it and checks the required 
information of the target process execution of system call. 
PEP takes and forwards this information to policy decision 
point(PDP) that will retrieve policies from policy repository 
within FDAM. Finally based on the policy evaluation is done 
to kill the process or accept the process and this is notified to 
user to take his decision based on the evaluation from 
FDAM.In this way collusion attacks are detected using 
FireDroid. There are few limitations in FireDroid if the 
policies were not designed and implemented there could be 
allowance of applications to collude and moreover there is no 
proper steps or policies given to the applications which are 
developed by same developers as a result collusion may 
happen. 

C. TaintDroid[19], is used for dynamic taint tracking to 
identify the information flows that reach sinks. TaintDroid 
assumes that third-party applications downloaded, are not to 
be trusted and monitors in real-time how these kind of 
applications access and manipulate user’s private data.Still 
better handling in control flow is needed to further detect 
collusion attacks. 

D. ScanDroid[20] is the tool for android and can detect 
information flow violations but it is not extensible with new 
taint propagation rules as a result of that collusion attacks 
detection is not effectively monitored. 
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E.PermissionFlow [2] is a static analysis-based technique for 
automatic identification of permission-protected information 
sources in permission-based sy stems. PermissionFlow 
identifies APIs whose base execution leads to permission 
checking and tracks the flow of APIs. The limitations are 
PermisionFlow is unable to trace of malevolent attacks 
performed by the top Android Market applications; when most 
applications correctly configure internal Activitys by not 
supplying an exported="true" attribute or an intent- filter. As it 
fails to detect some of the traces of intent filters and sharedUid 
collusion attack detection is not effectively done by using 
PermissionFlow. 

There are several other tools which are used for detection of 
Collusion attacks but we considered the most effective 
methods which are track the information flow as it makes 
much easier for detection of interpermission leaks. 

V. FUTURE SCOPE 

Most of the present methods and tools track the flow and if it 
leads to permission leakage it informs the user that the app is 
malicious and recommends the users to uninstall the app .As 
Android in present versions has given an option of only 
removing a permission where by disabling the required 
permission the user can run the app so there is a scope to 
track the malicious permissions and notify the user that 
malicious permissions should be disabled. We have also 
analyzed that there are identifying risky permissions out of 
present 135-140 permissions is easier .There are several 
methods by which the risky permissions can be given ranking 
so that if high risky permissions can be asked at installation 
user can directly disable that option once it is installed. There 
is also a scope to use RecDroid tool[14] where user 
recommendations are used for ranking the permissions and 
also disabling the malicious permissions. There is also a scope 
to categorize apps and based on the categorization the best 
possible permissions for a particular category of app should be 
given as per rating based on this user can know for which 


catregory of app which permissions should be treated as 
defaultly needed permissions and which permissions are asked 
extra and take a decision of disabling or enabling that extra 
permission asked other than specified in category relevant 
permissions. These are the different methods by which have 
scope to detect Collusion attacks. 

VI. Conclusion 

The substantial increase in the usage of Android mobiles has 
made the malware writers to choose this as a base platform for 
introducing the malware. The easiest way to introduce 
malware into smartphones is done by using the loophole of 
permission control. The attacks that are done through 
permissions are called as inter-permission leaks. In this paper 
we carefully analyzed how inter-permission leaks will happen 
and mentioned the inter-permission leaks ,as one of the inter- 
permission leaks of collusion attacks are still difficult to detect 
we analyzed how collusion attacks takes place ,with our 
analysis we mentioned the collusion attacks with examples. As 
per our analysis we took the different efficient existing tools 
which detects the inter-permission leaks ,we analyzed them 
and mentioned the areas where the existing tools lack 
detection the collusion attacks. Finally we draw to a 
conclusion that there is lot of scope for research in this area 
of detection of inter-permission leaks especially Collusion 
attacks as most of the android mobiles are releasing the private 
content of user. 
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Abstract — Requirements elicitation is the first and the most 
critical phase of Requirements Engineering (RE). Many 
techniques have been proposed to support the elicitation 
process. Each technique has its strengths and weaknesses. This 
variety makes the selection of technique or combination of 
techniques for a specific project a difficult task. Mostly 
techniques are selected based on personal preferences rather 
than on attributes of project, technique, and stakeholders. In 
this paper, the researchers propose a three- component 
approach for elicitation techniques selection. First, a literature 
review is conducted to identify the attributes affecting 
techniques selection and common elicitation techniques. 
Second, a multiple regression model is built to analyze these 
attributes in order to find the critical attributes influencing 
techniques selection. Finally, an Artificial Neural Network 
(ANN) based model for selecting adequate elicitation 
techniques for a given project is proposed. The ANN model 
helps reduce the human involvements in this process. It was 
implemented using Neural Network Fitting Tool in MATLAB. 
The network has accuracy of 81%. The ANN model was 
empirically validated by conducting a case study in a software 
company. 

Keywords: Requirements Engineering, Requirements Elicitation, 
Multiple Regression Analysis, Neural Network. 

I. INTRODUCTION 

Requirements Engineering (RE) is the process of 
formulating, documenting and managing software 
requirements. RE process is composed of various sub phases: 
requirements elicitation, analysis, specification, validation 
and management [1]. Requirement Elicitation is the first and 
the most important phase in the process of requirement 
engineering. It is the process that deals with seeking, 
uncovering, achieving, and detailing requirements for 
computer based systems [2]. Most of the software projects 
fail just because of the problems of requirements elicitation 
process. 

Many surveys have been conducted to explore the 
projects failure statistics. A survey conducted by Standish 
Group showed that 13.1% of projects fail due to incomplete 
requirements and 8.8% of projects fail due to changing 
requirements. Another survey found that 12.7% out of 1027 
projects were successful and the main reason for the failure 
was unclear and imprecise requirements [3, 4]. According to 
these various surveys, it is clear that poor requirements 


elicitation process is the most critical factor for the failure of 
software projects. 

The success of requirement elicitation process depends 
mainly on knowing which requirement elicitation technique 
to apply to a particular project. There are a variety of 
elicitation techniques such as interviews, observation, 
brainstorming, etc [5]. Each technique has its strengths and 
weaknesses. This is due to the fact that there is no one 
technique that can satisfy all situations. They work best at 
different situations and problems. Moreover, they can work 
in a complementary manner where the weakness of one 
technique can be compensated by the strengths of some other 
techniques [6]. 

Using a variety of techniques ensures discovering most of 
the requirements, and thus leads to effective requirements 
elicitation process. This variety makes the selection of a 
techniques or a combination of techniques for a specific 
project a challenging issue. Selecting inappropriate 
techniques has negative effects on the quality of the elicited 
requirements. Mostly the selection of requirements elicitation 
techniques is based on personal preferences rather than on 
the basis of characteristics of project, technique and 
stakeholders. 

Software engineers tend to select a particular technique 
for any combination of the following reasons [7]: it is the 
only technique that they know; it is their favorite technique 
for all situations; they follow a methodology that specifies a 
particular technique; or they guess that the technique is 
effective in the current circumstances. This subjective 
decision can bias the elicitation process and decrease the 
quality of elicited requirements. It is clear from the above 
analysis that an efficient approach for requirement elicitation 
is required which can be helpful for the selection of 
elicitation techniques. 

In this paper, the researchers propose a three- component 
approach for elicitation techniques selection. First, a 
literature review is conducted to identify the contextual 
attributes that may affect techniques selection process and 
common elicitation techniques. Second, a multiple regression 
model is built to analyze these attributes in order to find the 
critical attributes influencing techniques selection. Finally, a 
neural network based model for elicitation techniques 
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selection process is proposed which reduces the human 
involvements in this process. 

The rest of the paper is organized as follows: Section 2 
provides a background overview about the main concepts 
related to the research topic. Section 3 presents the related 
work focusing on elicitation techniques selection. Section 4 
describes the proposed approach and its main components. 
Section 5 presents a case study to validate the proposed 
model. The last section concludes the paper with final 
remarks. 

II. Backboard overview 

This section consists of three parts. The first part presents 
the requirements elicitation process and its tasks. The second 
part provides an overview of artificial neural network. The 
final part gives an analysis of the multiple linear regression 
model. 

A. Requirements Elicitation Process and Its Tasks 

Requirements elicitation is one of the important phases in 
the RE process. It is the process of collecting the 
requirements from stakeholders using different techniques 
[8]. Requirements elicitation process includes tasks that must 
allow for communication, prioritization, negotiation, and 
collaboration with all the relevant stakeholders [9]. Typical 
tasks of this process can be grouped into five types as shown 
in Fig. 1 [10]. 


biological neural system [11]. ANN is composed of a 
network of interconnected processing units (known as 
neurons). Fig. 2 shows the neuron model. In this model, 
various inputs to the neuron are represented by x b x 2 , x 3 , 
...,x n . Each line that connects these inputs to the neuron is 
assigned a weight. These weights are represented by w b w 2 , 
w 3 , ...,w n [12]. 



Neurons receive the inputs and sum them. A neuron has a 
rule for calculating an output which is called the activation 
function. In general, there are three types of activation 
function: threshold function, piecewise linear function and 
Sigmoid function. The selection of activation functions is 
based on the types of problem to be solved by the network 
[13]. The total received input in a neuron can be calculated as 
follows: 



Figure 1 . Requirements Elicitation Process 

Our paper focuses on the task of selecting techniques for 
elicitation process because it is a difficult task for software 
engineer to decide which technique or combination of 
techniques is the most suitable for a given project. There are 
a range of techniques for eliciting requirements including 
interviews, surveys, Joint Application Development (JAD), 
prototyping, etc. 

B. Artificial Neural Networks (ANN) 

ANN is a system that is inspired by the way biological 
neural networks work. In other words, it is an imitation of 


/ = WjXj + w 2 x 2 + ••• + w n x n 



i= l 


( 1 ) 


Basically, all ANNs have a similar structure or topology. 
The structure of ANN typically has three layers: an input 
layer, one or more hidden layers and an output layer. The 
input layer receives data from outside the neural network. In 
the hidden layers, computation is done according to function 
provided. The output layer sends data out of the neural 
network [14]. 

C. Multiple Linear Regression 

Regression analysis is a standard statistical technique for 
determining the relationship between two or more variables 
which have reason and result relation. The simple regression 
analysis estimates the relationship between a dependent 
variable and one independent variable and formulates the 
linear relation equation between dependent and independent 
variable. Regression models with one dependent variable and 
more than one independent variable are called multiple 
regression analysis [15]. 

According to McClave and Benson [16], the multiple 
regression model, assuming that there are n independent 
variables, is formulated as follows: 


y = /?o+/?i*i +/?2*2+-" + p n x n + £ ( 2 ) 

In this model, y represents the dependent variable and 
x b x 2 , ...,x n are the independent variables, and /?; is the 
regression coefficient, and £ is the random error component. 
The value of the coefficient determines the contribution of 
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the independent variable x i? given that the other (n-1) 
independent variables are held constant and /? 0 is the y- 
intercept [17]. 

III. Related Work 

There are many studies conducted to describe elicitation 
techniques and provide some guidance on their use and some 
others comparing elicitation techniques. However, little 
research has focused on selecting the right technique or 
combination of techniques for a specific project. For 
example: 

• Carrizo, Dieste and Juristo [7] proposed a 

framework to help requirements engineers in 
selecting elicitation techniques at any time. To do 
this, they determined the contextual attributes of the 
elicitation process. Then, they established the 
adequacy values of each technique for each attribute 
value. 

• Tiwari, Rathore and Gupta [18] developed a 

framework based on project’s contextual 
information to select elicitation techniques for a 
given project. One of the limitations of this 
approach is that the mapping function used by the 
approach is theoretical one, 

• Masooma, Asger and Bokhari [19] presented a 
systematic approach for selecting the appropriate 
elicitation techniques based on various factors such 
as system type, requirements type, time, budget, 
stakeholder involved, technique maturity, available 
expertise, etc. 

• Anwar and Razali [20] provided a practical guide 
for selecting the right RE techniques for a given 
project. This guide consists of a set of factors 
indentified from a field study including stakeholder 
characteristics, technique features, project 
environment, etc. 

• Muqeem and Rizwan [21] proposed a framework 
that helps elicitor to select elicitation methods. The 
framework components consist of the following: 
Pre-Domain Development, Stakeholders 
Management, Technique Selection, and 
Prioritization. 

• Jiang, Eberlein and Far [22] proposed a knowledge- 
based approach that helps in RE techniques 
selection. This approach integrates advantages of 
knowledge representation schemata and reasoning 
mechanisms. 

• Kheirkhah and Deraman [23] identified important 
factors in RE technique selection from different 
viewpoints including technique attributes, project 
and organizational and classified them based on RE 
tasks. 

• Hickey and Davis [24] proposed a model for the 
elicitation technique selection and identified a set of 
factors that should be considered during technique 
selection. 


One of the problems related to all presented approaches 
or models is that they have human involvement during 
technique selection. This human involvement may bias the 
technique selection process. Another problem is that they 
consider few attributes that can influence the technique 
selection and therefore they are not greatly useful. Unlike 
these researches, our work tries to: 

• Identify the most critical attributes that influence the 
process of elicitation techniques selection. 

• Automate the process of elicitation techniques 
selection by developing a neural network based 
model. 

• Reduce the human involvement in the process of 
elicitation techniques selection. 

IV. The Proposed approach 

This section describes the proposed approach that helps 
in the selection of appropriate elicitation techniques for a 
specific project. The step by step stages of the proposed 
approach is shown in Fig. 3. The steps of the proposed 
approach are described below: 

• Step 1: Review the related literature reporting 
contextual attributes that may affect requirements 
elicitation techniques selection. 

• Step 2: Analyze the identified attributes using a 
number of criteria to define a preliminary list of 
influential attributes. 

• Step 3: Conduct a web survey with Likert- type 

scale questionnaires to measure the importance of 
candidate attributes in elicitation techniques 

selection process. 

• Step 4: Analyze the collected data using a multiple 

regression model to find out which attributes can 
positively impact the elicitation techniques 

selection. 

• Step 5: Propose an ANN model that is based on the 
selected attributes for selecting adequate elicitation 
techniques. 

• Step 6: Train the ANN based on training and test 
data sets 

• Step 7: Evaluate the ANN model accuracy and 
performance. 

The proposed approach consists of three main 
components. These components are discussed briefly in the 
following subsections. 
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V 



Figure 3. Proposed Approach 

A. Identifying Contextual Attributes and Elicitation 
Techniques Through Literature Review 

As specified in the proposed approach and illustrated in 
Fig. 3, the first step is to determine a set of attributes 
influencing techniques selection and common elicitation 
techniques. As shown in Fig. 4, this step includes the 
following procedures: 


1. Review the related studies directly defining 
attributes that may affect elicitation techniques 
selection process and/or proposing elicitation 
techniques. 

2. Categorize the identified attributes into factors 
describing the contextual elements that influence 
elicitation process. 

3. Analyze each candidate attribute by a number of 
criteria to decide whether it should be included or 
removed. 

4 . After analyzing the attributes, an action can be 
taken with each attribute in order to define the 
preliminary set of attributes that may influence 
techniques selection. 



Figure 4. The Procedures of Literature Review 

1) Identifying Influential Attributes: To identify the 
influential attributes, two types of studies were examined: 
framework proposals and empirical studies. The framework 
proposals define attributes or factors that were proposed by 
the authors to have an impact on the techniques selection 
process. The empirical studies involve experiments that 
show how a variation in some attributes changes the 
effectiveness of some techniques. From these sources, a 
preliminary set of 25 possible influential attributes was 
identified. These attributes can be classified into four 
categories: 

• Elicitor : Requirements engineer or analyst who is 

responsible for eliciting information related to 
requirements form stakeholders. 
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After analyzing and assessing all the candidate attributes 
according to these criteria, an action can be taken with each 
attribute. Table 1 shows a summary of this analysis. The 
actions that can be taken with respect to an attribute are as 
follows: 

• Include ( C): Include the attribute in the preliminary 
list of influential attributes 

• Remove (R): Remove the attribute from the 
preliminary list of influential attributes. 

• Merge (M): Merge with another attribute because 
they are identical. 

• Modify ( F): Change the attribute name. 

As a result of this analysis, 17 out of 25 candidate 
attributes were selected to be included in the preliminary set 
of influential attributes, 4 of 17 selected attributes were 
renamed, and 4 of 25 were merged. Additionally, one 
attribute was added based on the researchers' practical and 
theoretical experience. These totaled 16 attributes: 3 related 
to the elicitor factor, 6 related to the stakeholder factor, 6 
related to the project characteristics and two related to the 
elicitation process factor. 

Table 2 shows the preliminary set of selected attributes 
for the elicitation process along with their descriptions. Each 
attribute was expressed in terms of low, medium and high 
levels in order to determine its role in the techniques 
selection process. Furthermore, these levels were expressed 
numerically on a 0-10 scale to use them effectively in 
techniques selection. 


TABLE 1 THE SUMMARY OF ATTRIBUTES ANALYSIS 


Factors 

Attributes 

Proposing Authors 

Unambiguity 

Measurability 

Practicability 

Action 

Elicitor 

Requirements engineering experience 

[25, 7, 26, 27] 

Y 

H 

H 

C 

Technical knowledge of elicitation 
methods 

[7, 26] 

Y 

H 

H 

M 

Knowledge of (familiarity with) domain 

[ 27, 28, 29] 

Y 

H 

H 

F 

Experience with elicitation techniques 

[7, 26] 

Y 

H 

H 

C 

Cognitive problems 

[7, 26] 

N 

M 

L 

R 

Stakeholder 

Number of users 

[7, 26] 

Y 

H 

H 

F 

Number of experts 

[7, 26] 

Y 

H 

H 

M 

User involvement 

[25,30,31,32, 33] 

Y 

H 

H 

F 

Location/accessibility 

[7, 26] 

N 

L 

L 

R 

Availability of time 

[7, 26] 

Y 

H 

H 

C 

Expertise 

[7, 26] 

Y 

H 

H 

C 

Articulability 

[7, 26] 

Y 

H 

H 

C 

Personality variables 

[7, 26, 34, 35] 

N 

L 

L 

R 

Cognitive problems 

[p2, rlO] 

Y 

M 

M 

F 

knowledge and skills 

[31,33] 

Y 

H 

M 

M 

Communication Skills 

[31,34, 35] 

Y 

H 

M 

M 

Project 

characteristics 

Project complexity 

[36, 37, 38] 

Y 

M 

H 

C 

Requirements size 

[36, 37, 38] 

Y 

M 

M 

C 

Cost available 

[36, 37] 

Y 

H 

H 

C 

Time constraints 

[37, 39, 38] 

Y 

H 

H 

C 

Requirements volatility 

[37, 39, 38] 

Y 

M 

M 

C 

Clarity of project scope 


Y 

M 

H 

+ 

Elicitation 

process 

Purpose of requirements 

[7, 26] 

N 

L 

L 

R 

Process time 

[7, 26] 

Y 

H 

M 

C 

Development methodology 

[7, 26] 

Y 

M 

H 

F 


• Stakeholder. Stakeholders are persons who have a 
stake in the project. Stakeholders can be customers, 
users, mangers, etc. 

• Project characteristics’. Each project has some 
attributes that distinguish one from anther based on 
goal of domain project. 

• Elicitation process : It is the process of gathering 
requirements from stakeholders using different 
techniques. 

Inclusion of an attribute in the preliminary list of 
attributes influencing techniques selection process is based 
on a number of criteria. Each candidate attribute was 
analyzed according to these criteria to decide whether it 
should be selected to be part of the preliminary list. These 
criteria are: 

• Unambiguity : Whether the attribute is well 
explained and defined. The possible values include: 
yes (Y) and no (N). 

• Measurability: Extent to which ratings can be 
defined for the different values of the attribute. The 
possible values include: low (L), medium (M) and 
high (H). 

• Practicability: Possibility of assigning a value to the 
attribute at any time for a real project case. The 
possible values include: low (L), medium (M) and 
high (H). 
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TABLE 2 PRELIMINARY LIST OF THE SELECTED ATTRIBUTES 


Factors 

Attributes 

Description 

Elicitor 

Requirements engineering experience 

Number of previous projects in which the elicitor has performed RE activities. 

Understanding domain knowledge 

Number of previous projects in the domain executed by the elicitor. 

Experience with elicitation techniques 

Previous experience or training acquired by the elicitor with each elicitation 
technique. 

Stakeholder 

Number of individuals in the process 

Number of users that can participate in the elicitation process. 

Stakeholder interest 

Stakeholder’s motivation to participate in the elicitation process. 

Availability of time 

Time that the stakeholder has to spend on the elicitation process. 

Expertise 

Stakeholder’s experience in the problem or work domain. 

Articulability 

Stakeholder’s skill at analyzing and explaining his or her knowledge. 

Stakeholders conflicts 

Level of agreement among stakeholders. 

Project 

characteristics 

Project complexity 

Complex of project based on its structure, requirements needed and functions. 

Requirements size 

Number of requirements of the project. 

Cost available 

Budget constraints on the project. 

Time constraints 

Time that is available for eliciting requirements. 

Requirements volatility 

Change in requirements during software project developments. 

Clarity of project scope 

Clarity of project goals and scope. 

Elicitation Process 

Process time 

Stage at which the elicitation process is prior to the session. 

Development methodology used 

Methodology used to develop the system. 


2) Selecting Elicitation Techniques: A variety of 

elicitation techniques has been presented in the literature to 
address different requirements problems. These techniques 
can be classified into four categories with respect to means 
of communication: classic, cognitive, group elicitation or 
contextual. In this paper, the focus is on the techniques 
which are commonly used most cited. At least two 
techniques were selected from each of the above categories. 
Table 3 shows the techniques that were selected for this 
research. 

B. Multiple Linear Regression Analysis 

This section presents the multiple linear regression model 
which can be used to identify the most critical contextual 
attributes that affect the process of elicitation techniques 
selection. The model was implemented using Microsoft 
Excel. This section consists of four sub sections which are 
described below. 

1 ) Data Collection: To gather the data a web survey 
method was employed. The targeted population was 
professionals working in different software houses. As 
Likert scale is the most commonly used scale in quantitative 
research, a web survey with Likert-type scale questionnaires 
was distributed to the targeted population. The survey 
consists of four sections. The first section includes both of 
the respondent’s details as well as the software project 
information. The second section includes candidate 
attributes that influence techniques selection. To measure 
the importance of candidate attributes, a 10-point Likert 
scale was used. The third section includes degree of 
influence on techniques selection process. To measure this 
degree, a 10-point Likert scale was also used. The last 
section allows respondents to give any additional comments. 
After a five- week survey period, a total of 300 people 
responded by accessing the online survey. 


TABLE 3 Common elicit aion techniques 


No 

Technique 

Literature 

Support 

Type 

Description 

1 

Interviews 

[5, 40, 41, 
42, 43] 

Classic 

It helps get the holistic view 
of the entire system. 

2 

Task Analysis 

[5, 41] 

Classic 

It is used to mange tasks 
between user and system. 

3 

Card Sorting 
/Laddering 

[5, 40, 41, 
43, 44] 

Cognitive 

They are used to prioritize 
stakeholders’ needs. 

4 

Questionnaires 

[40, 41, 43] 

Classic 

They help get large data from 
large number of stakeholders 
in lesser time and with low 
cost. 

5 

Protocol 

Analysis 

[5, 41] 

Group 

Elicitation 

It helps in understanding the 
processes of the system being 
developed. 

6 

Repertory Grid 

[40, 41] 

Cognitive 

It helps identify various 
characteristics among 

different domain units. 

7 

Brainstorming 

[5, 40, 41, 
43] 

Group 

Elicitation 

It helps generate innovative 
ideas. 

8 

Observation 

[40, 41, 42] 

contextual 

It is used to conduct an 
assessment of users’ work 
environment. 

9 

Prototyping 

[5, 40, 41, 
42] 

Group 

Elicitation 

It is used to get early 
feedback from stakeholders. 

10 

Focus Group 

[5,41,45] 

Group 

Elicitation 

It is very effective to handle 
conflicts between different 
stakeholders. 

11 

JAD 

[5, 40, 41, 
46] 

Group 

Elicitation 

JAD helps in making rapid 
decision and mostly used in 
business analysis. 

12 

Surveys 

[40, 41, 47, 
48] 

Classic 

They are used to collect data 
from large number of 
population. 

13 

Workshop 

[5,41,48] 

Group 

Elicitation 

It is a small meeting between 
stakeholders for capturing 
large and complex 

requirements. 

14 

Ethnography 

[40, 41, 43, 
49] 

Contextual 

It is useful in capturing 
contextual factors such as 
usability. 
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2) Building a Multiple Linear Regression (MLR) Model: 
As this study is an exploratory study to find out which 
attributes can positively impact the elicitation techniques 
selection process, it is appropriate for a multiple regression 
analysis, where the relationship between multiple 
independent variables (attributes influencing technique 
selection) and the dependent variable (degree of influence 
on techniques selection process) is determined. Based on the 
Eq. (2), the multiple linear regressions model can be 
expressed as follows: 

y C DITSP ) = p 0 +p 1 CT 1 + p 2 CT 2 + - + p l2 CT 12 (3) 
Where: 

y (DITSP): is Degree of Influence in Techniques 
Selection Process (dependent variables) 

P 0 : is the y-intercept 

Pi : is the regression coefficient 

CT t : is the Candidate Attribute 

3) Data Analysis and Results: In this basic analysis, there 
were only three main tables presented to provide useful 
information about the model and the contribution of each 
explanatory variable. The first table of interest is the Model 
Summary table (Fig. 5). This table provides the multiple 
correlation coefficient (R) which is 0.997, coefficient of 
determination (R 2 ) which is 0.994, and finally adjusted R 
square (R 2 ) which is equal to 0.819. This indicates that 
81.9% of the variance in the dependent variable (DITSP) 
can be explained by the independent variables (selected 
attributes), while the rest (18.1%) is explained by other 
causes. In other words, the elicitation technique selection 
process is strongly related to the selected attributes of 
elicitor, stakeholders, project, and elicitation process. 


Regression Statistics 

Multiple R 

0.997136739 

R Square 

0.994281675 

Adjusted R Square | 
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Standard Error 
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Figure 5: Regression Statistics 
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Figure 6: NOVA Analysis 


ANOYA table (Fig. 6) shows an analysis of variance that 
provides information about levels of variability within a 
regression model and its explanatory power. The F value and 
the associated significance value in the ANOYA table 
indicate the statistical significance of the multiple regression 
model. For this model, the p value for F statistic is lower than 
0.05 (p<0.05). This means that the multiple regression model 
is generally acceptable and statistical significant to determine 


the most important attributes that influence elicitation 
techniques selection. 



Coefficients 

Standard Error 

tStat 

P-value 

Intercept 

8.069827055 

6.260233593 

1.289061652 

0.044840862 

CT1 

1.516518091 

2.719074528 

0.596303942 

0.027663295 

CT2 

1.674002412 

2.780752167 

0.35744722 

0.00 1299 398 

CT3 

0.298426484 

2.300301053 

0.672947187 

0.526036343 

CT4 

1.200576822 

2.431115998 

0.241808635 

0.031698092 

CT5 

0.587864841 

2.404734989 

0.630638344 

0.551525486 

CT6 

1.057823388 

2.213529539 

0.284160846 

0.035179945 

CT7 

1.547981122 

4.568137764 

0.38689 7291 

0.009254345 

CTS 

0.879654321 

0.443275946 

2.70841861 

0.712182345 

CT9 

1.547654321 

2.098765438 

0.192836247 

0.011667453 

CT10 

0.993972131 

2.654376827 

0.76543277 

0.039 3 79 3 21 

CT11 

0.386776543 

2.074885864 

0.368344112 

0.512 555858 

CT12 

1.767400125 

2.13295 2342 

0.816046507 

0.000294526 

CT13 

1.640012412 

4.613762442 

0.879058113 

0.004024345 

CT14 

0.628998427 

0.4375 6 2711 

2.084109973 

0.18234255 

CT15 

1.621394861 

4.683772144 

0.868708581 

0.007254499 

CT16 

1.151091243 

0.435656567 

65535 

0.03351 

CT17 

0.40928414 

1.450278143 

0.282210789 

0.91268 


Figure 7: coefficients Analysis 


The Coefficients table (Fig. 7) gives the regression 
coefficients (b) that indicate the individual contribution of 
each independent variable to the model. The p- value in the 
Coefficients table indicates which variables are significant. If 
p<0.05, the coefficient is statistically significant. Otherwise, 
the result is opposite. For instance, the p- value for CTn 
(p=0.91) is greater than 0.05, therefore, this attribute should 
be rejected. Thus, those variables with significance level 
p<0.05 and top coefficient values would be recognized as 
candidates for being important attributes influencing 
techniques selection. After the screening of p-values, the 
most critical attributes which affect elicitation techniques 
selection were determined as presented in Table 4. 


TABLE 4 SUMMARY OF CRITICAL ATTRIBUTEES INFLUENCING TECHNIQUES 
SLECTION PROCESS 


No. 

Attribute 

ID 

Attribute 

Name 

Coefficients 

Factor 

1 

CT12 

Cost available 

1.767400 

Project 

2 

CT2 

Understanding domain 
knowledge 

1.674002 

Elicitor 

3 

CT13 

Time constraints 

1.640012 

Project 

4 

CT15 

Clarity of project scope 

1.621395 

Project 

5 

CT7 

Expertise 

1.547981 

Stakeholder 

6 

CT9 

Stakeholders conflicts 

1.547654 

Stakeholder 

7 

CT1 

Requirements 
engineering experience 

1.516518 

Elicitor 

8 

CT4 

Number of individuals 
in the process 

1.200577 

Stakeholder 

9 

CT16 

Process time 

1.151091 

Elicitation 

Process 

10 

CT6 

Availability of time 

1.057823 

Stakeholder 

11 

CT10 

Project complexity 

0.993972 

Project 
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4) The Proposed Algorithm of the MLR Analysis: This 
section presents an algorithm for identifying the critical 
attributes that influence the process of elicitation techniques 
selection by a multiple regression analysis. Fig. 8 shows the 
flow chart of the proposed algorithm. The steps of the 
proposed algorithm are as follows: 

1. Start 

2. Determine the dependent variable (degree of 
influence on techniques selection process) and the 
independent variables (attributes influencing 
techniques selection). 

3. Build the multiple linear regression model based on 
the general equation of MLR. 

4 . Estimate the MLR model. 

5 . Check the value of the adjusted R square (R2-) to 
measure how much of the variability in the 
dependent variable that is accounted for by the 
independent variables 

6. Test whether the regression model is statistically 
significant to determine the critical attributes 
influencing techniques selection by an analysis of 
variance (ANOYA). 

7. If significance F > 0.05 

{ 

- Update the explanatory variables 

- Estimate the model 

} 

Else 

{ 

- Accept the model 

} 

8. Check the P-values for each variable to identify 
which attributes are significant. 

9. If P- Value > 0.05 


- Reject the attribute 


Else 

{ 


Start 


Specify dependent 
variable and 
independent variables 



N 

\ 

0 

! 

Accept the model 

\ 

/ 

Check the P-values 
for each variable 

\ 

/ 


Update the 
explanatory 
variables 



Include the attribute 
in the critical list 


Rejei 

attri 

;t the 
bute 


End 


Figure 8: Flow Chart of the Proposed Algorithm for MLR Model 


- Include the attribute in the critical list of 
influential attributes 

} 

10 . End 


C. Artificial Neural Network Based Model for Elicitation 

Techniques Selection 

This section presents a neural network based model for 
selection of requirements elicitation techniques. The ANN 
model was implemented using Neural Network Fitting Tool 
in MATLAB. The data collection, ANN architecture, training 
the network, evaluating its performance and proposed 
algorithm of ANN model are described in the following sub 
sections. 
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1) Data Collection and Preprocessing: Data Collection 
and preparation is the first step in designing ANN model. 
The research’s targets were software engineers in different 
software development companies. For this study, the data 
was gathered by a survey sent out via e-mail from February 
to May 2016. About 300 surveys were sent to the 
companies. The returned questionnaires were 160, which 
indicated a response rate of 53.33%. After data collection, 
the data preprocessing was conducted to train the ANN. 

All the data was normalized using min-max 
normalization to speed up the training phase. Min-max 
normalization performs a linear transformation on the 
original data values. Suppose that min T and max T are the 
minimum and maximum values of an attribute T. It maps 
value v of T to v’ in the range [0.0- 1.0] using the following 
formula [50]: 


max? -min? v J 

2) ANN Modeling: In this paper, a feed-forward back 
propagation network is proposed to predict suitable 
elicitation techniques for a given project. It consists of three 
layers: the input layer, one or more hidden layers, and the 
output layer. The inputs to the neural network are influential 
attributes presented in Table 4. The output is a combination 
of appropriate elicitation techniques. The properties of ANN 
proposed in this paper are presented in Table 5. 


The training method of ANN is Levenberg-Marquardt 
algorithm. In this algorithm, the different weights of neurons 
are adjusted so as to minimize the error between network 
outputs and target outputs. Trainlm function was selected as 
a training function to train the network as shown is Fig. 10. 
The selected transfer function was tansig. The adaptation 
learning function is learngdm and performance function as 
MSE. Training continues as long as network continues 
improving on the validation set. 



TABLE 5 NEURAL NTEWORK MODEL PROPERTIES 


Architecture 

Hidden Layer 

1 

Hidden Neurons 

20 

Input Neurons 

11 

Output Neurons 

14 

Training 

Training Algorithm 

Levenberg-Marquardt 

Training Function 

TRAINLM 

Transfer Function 

TANSIG 


3) Network Training: The neural network was trained by 
160 data records. For this research, 100 out of 160 records 
were considered for the training set, 30 for the validation set 
and the remaining 30 for the test set. The network was tested 
by investigating different numbers of neurons (i.e., 12, 20, 
25, 28 and 32) in hidden layer(s) in order to select the best 
structure of the network. The best results were obtained 
from the network with 1 1 inputs, 20 neurons in the hidden 
layer and 14 outputs as shown in Fig. 9. 


Hidden Layer Output Layer 



4) Performance Evaluation: In order to assess the model 
accuracy, it is necessary to use some quantitative measures 
of learning. In this study, the Mean Squared Error (MSE) 
and regression analysis were used to evaluate the model 
performance. MSE is a useful measure of success for 
numeric prediction and is calculated using Eq. (5). It is 
worth mentioning that small values of MSE indicate better 
performance of the ANN model. It was found that the 
optimum performance of the model is at 25 neurons with 
MSE 0.1481. The accuracy of the model with different 
neurons is presented in Table 6. 

MSE = (5) 

NP W 

Where P is the number of output possessing elements, N 
is the number of observations, t tj are the target outputs and y# 
are the actual outputs. 


TABLE 6 ACCURACY FOR THE NETWORK 


Neurons in the 
Hidden Layers 

MSE Values 

12 

0.3263 

20 

0.1652 

25 

0.2672 

28 

0.2870 

32 

0.6301 
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Fig. 11 shows the regression analysis of targets and 
outputs for Levenberg-Marquardt algorithm during training 
and testing process. The best fit lines in Fig. 11 demonstrate 
the relationship between the desired value and actual value. 
As shown in Fig. 11 the R value of ANN model is 0.8134 
which indicates that LM algorithm has high ability to train 
data for the model. Overall, the network has an accuracy of 
81%. 


Test: R--0.19181 All: R=0.81343 




Figure 1 1 : Neural Network Regression Analysis 


5) The Proposed Algorithm of Neural Network Based 
Model for Elicitation Technique Selection: This section 
presents an algorithm of neural network based model for 
selection of requirements elicitation techniques. Fig. 12 
shows the flow chart of the proposed algorithm. The steps of 
the proposed algorithm are as follows: 

1. Start 

2. Normalize the input data using min-max 
normalization in the range [0.0- 1.0]. 

3. Divide the data set into training, test, and validate. 

4 . Create the neural network initially with N inputs, H 
neurons in the hidden layer(s) and O outputs. 

5. Train the neural network using Levenberg- 
Marquardt algorithm. 

6. If MSE is small 

{ 

- Save the neural network 

} 

Else 

{ 

- Update the weights and bias between 
input and hidden layers. 

- Modify the number of neurons in the 
hidden layers. 

} 

7 . Test and validate the network to check out the 
model accuracy. 

8. Use the trained and verified network for predicting 
suitable combination of elicitation techniques for a 
given project. 



Figure 12: Flow Chart for the Proposed Algorithm of ANN Model 
V. CASE STUDY 

In order to empirically validate our ANN model a case 
study in a software company X was conducted. One project 
was selected for the case study. One of the problems of the 
selected project is that the techniques selection procedures 
are based on elicitor experience. The aim of this case study is 
to check whether the proposed ANN model predicts more 
effective techniques than the current requirements 
engineering methods. The contextual situation of the 
selected project that is the particular values for the influential 
attributes is shown in Table 7. Table 8 shows the normalized 
values calculated using Eq. (4) for each attribute. These 
normalized values are considered the input data for the ANN 
model. 
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TABLE 7 EXAMPLE OF PROJECT INPUT DATA 


Project 

Description 

The goal of the project was to develop an Android mobile 
application that enables users to scan their physical 
documents using their smart devices camera, then crop, 
enhance, sync, share, store and manage these documents 
as needed. 

Selected 

Attributes 

Values 

Cost available (TV): Low (3) 

Understanding domain knowledge( T 2 ): Low (2) 

Time constraints (TV): Medium (5) 

Clarity of project scope (TV): Low (3) 

Expertise ( T 5 ): Medium (5) 

Stakeholders conflicts (TV): High (8) 

Requirements engineering experience (TV): Low (3) 

Number of individuals in the process (TV): Medium (5) 

Process time (TV): Low (1) 

Availability of time ( T 10 ): Low (2) 

Project complexity ( T n ): Medium (6) 


TABLE 8 NORMALIZED INPUT DATA 


Attribute 

Normalized Value 

TV 

0.3 

t 2 

0.2 

t 3 

0.5 

t 4 

0.3 

t 5 

0.5 

TV 

0.8 

TV 

0.3 

TV 

0.5 

TV 

0.1 

T 10 

0.2 

T n 

0.6 


R=[l 1 1 101000100011 

The result of ANN model is shown in matrix R. The 
result is analyzed as ‘1’ in the output vector corresponds to 
the elicitation technique that is selected for the elicitation 
process while ‘O’ in the output vector corresponds to the 
elicitation technique that is rejected to use in the elicitation 
process. As a result, the ANN model recommended the 
following techniques: interviews, task observation, card 
storing/laddering, questionnaires, repertory gird, focus group 
and ethnography. 

VI. CONCLUSION AND FUTURE WORK 

This paper proposed an approach to help requirements 
engineers in selecting the most suitable elicitation techniques 
for a particular project. To do this, the literature review was 
conducted to identify the attributes which are relevant to the 
context of the elicitation process and influence techniques 
selection. Then, these candidate attributes were analyzed 
using a multiple regression model to find the most important 
attributes that influence techniques selection and eliminate 
the less critical ones. Finally, a neural network based model 
for elicitation techniques selection was developed. In order to 
empirically validate our ANN model a case study was 
conducted. The results showed that the proposed ANN model 
has proven its effectiveness in selecting more effective 
techniques than the current requirements engineering 
methods. It is recommended as a future work to integrate 
other machine learning techniques such as fuzzy logic with 
the proposed ANN model in order to enhance the model 
accuracy. 
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Abstract — Proxy Re-Encryption has been used since the need 
for forwarding an encrypted message to a party for whom it was 
not encrypted was highlighted in the form of delegation rights by 
Blaise, Bleumer and Strauss. Various Proxy Re-Encryption 
schemes have been introduced till today mainly focusing on 
demonstrating features like transitivity and collusion-resistance 
to ensure minimal trust on the proxy and maximum key -privacy. 
This survey highlights some major schemes introduced, classifies 
them based on their directionality, brings to light their major 
advantages and disadvantages, and provides a detailed 
comparative study based on the key features a Proxy Re- 
Encryption Scheme must possess in order for its widespread. 
Index words — bilinear maps, CCA secure, collusion resistance, 
CPA secure, delegation rights, Deffie-Hellman key exchange, 
DBDH assumptions, Proxy Re-Encryption; transitivity. 

I. Introduction 

Considering the direction of development from traditional 
sequential systems towards distributed systems, cloud 
computing where different computational infrastructures are 
available to the users as services (infrastructure as a service, 
platform as a service, software as a service etc), IOT; security 
and privacy of data has become the primary concern of 
organizations and users worldwide because these 

developments require an unavoidable sharing of resources, 
personal and confidential data over the network. Although 
network security schemes have been implemented and provide 
access and authorization controls, need still remains of further 
improvement. Proxy re-encryption is a relatively new data 
encryption technique devised primarily for distributed data 
and file security. The goal of proxy re-encryption is allowing 
the re-encryption of one cipher text to another cipher text 
without relying or trusting the third party that performs the 
transfer. In situations where one user wishes for another user 
to decrypt a message using its own or a new secret key instead 
of the first user’s secret key, one technique involves the 
assistance of a proxy. An easily implemented re-encryption 
scheme is one in which the proxy is given possession of both 
Users’ keys so the message can be converted to plaintext and 
then re-encrypted for the second user but this is comparatively 
weak. Userl’s secret key decrypts the cipher-text to plaintext, 
while User2’s secret key encrypts it. But this is a violation of 
the primary goal of security; the purpose of proxy re- 
encryption schemes is to prevent the revelation of the keys 
involved in re-encryption and the plaintext that needs to be re- 
encrypted to the proxy. In this context the method mentioned 
above is not ideal. So for these scenarios where trust cannot be 


placed in a proxy, the requirement here is to convert messages 
encrypted under Userl’s public key to messages encrypted 
under User2’s public key without the proxy being able to 
decrypt the message. The scheme that ensures this 
arrangement is known as proxy re-encryption. Even though 
Proxy re-encryption schemes are basically a version of 
existing encryption schemes consisting of selection of text, 
generation of keys, sharing or transmitting of keys between 
concerned parties, conversion from plaintext to cipher-text on 
one end and conversion from cipher-text to plaintext on the 
other end, the difference arises with the introduction of two 
more properties. 


User A 


MpMA) 


Fig. 1 . Representation of Proxy Re-Encryption 


Proxy 

Mhc(AB) = 

Re-Encrypt Ik(AB ) (M,*<a>) 


User B 


MrkfAB) 


Directionality 


If the re-encryption scheme is reversible — that is, the same re- 
encryption key is used to translate messages from Userl to 
User2, as well as from User2 to Userl the scheme is classified 
as a bi-directional scheme. In these schemes if a user forwards 
a message to another, it automatically gives rights to the 
receiver to communicate with the sender. Such re-encryption 
keys are hence generated with the keys of both sender and 
receiver and with their mutual trust and consent. 


A unidirectional scheme is one-way in this context; giving a 
higher level of security and making it a feasible option in non 
trusted setups where message conveying is essential but not to 
an extent where receiver should be given rights to respond to 
it. So if a message is re-encrypted from Userl to User2 with a 
key, it cannot be used for re-encryption from User2 to Userl. 
Moreover uni-directional schemes are more useful since they 
can be converted to bidirectional scheme at any time simply 
by running it in both directions, i.e. from Userl to User2 and 
from User2 to Userl [14]. 

Transitivity 
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Transitivity in proxy re-encryption schemes is defined as the 
number of re-encryptions allowed by an algorithm. A 
transitive PRE scheme would allow a cipher text to be re- 
encrypted from Userl to User2, and then again from User2 to 
User3 and so on. While a non-transitive scheme would allow a 
cipher text to be re-encrypted for a single time (or a pre- 
defined limited number). This implies that inn non-transitive 
schemes the proxy does not have the authority to assign 
delegation rights to others beside the pair of communicating 
users. Besides the above mentioned properties, some more of 
the security properties demonstrated by existing proxy re- 
encryption schemes are [3] the inability of the proxy to view 
plaintext irrespective of the scheme. The secret keys are 
generated at the data owner’s end, and the proxy in no way 
can derive the secret keys of the sender or receiver from the 
re-encryption key. The transitivity and delegation level of an 
applied scheme depends on the trust matrix of the involved 
parties, on the fact level of security at each party’s end and the 
priorities of the involved parties (security, confidentiality, 
integrity, etc). The need of PRE schemes was first highlighted 
when Mambo and Okamoto in 1997 mentioned the concept of 


delegating decryption rights to improve efficiency instead of 
the conventional decrypt-and-then-encrypt approaches. [2] 
This work was enhanced by Blaze, Bleumer, and Strauss 
(BBS) in 1998 when they proposed an application called 
atomic proxy re-encryption. In their proposed schemed a 
partially-trusted proxy was allowed to perform conversion 
from a cipher-text for one user into a cipher-text for another 
user but was not allowed to access the underlying plaintext. 
[ 1 ] 

Although efficiently computable, flexible and applicable the 
adoption of BBS re-encryption over a larger application 
domain for managing encrypted file systems has been 
hindered by considerable security risks. [4] These methods are 
still under process of maturity and require fine tuning before 
being adopted in every organization. 

II. CLASSIFICATION AND ANALYSIS OF PROXY RE- 
ENCRYPTION SCHEMES 



Fig. 2. Classification of Proxy Re-Encryption Schemes Based on Directionality 
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A. Type and Identity Based Proxy Re -encryption 
Scheme 

This scheme has thrown light on the problem of 
multiple delegations of decryption rights. Suppose 
the delegator wants two different users to view 
different sub parts of his message. The solution 
would be to place trust in the proxy to re-encrypt the 
selective parts of the cipher-texts using this method. 
This fails if the proxy is corrupted. A better but 
unrealistic alternative is choosing a separate pair of 
keys for each delegate. The type-and-identity-based 
proxy re-encryption scheme is based on the Boneh- 
Franklin Identity Based Encryption scheme [19] 
enabling implementation of different access control 
policies for cipher-texts against multiple receivers. 
The messages are categorized into different types 
according to the decryption rights of the intended 
receivers. The main benefit of this scheme is the 
single pair of keys which provides re-encryption 
capability to the proxy for his cipher-texts against his 
receivers. But the proposed scheme works only for 
the cipher-texts generated by the sender. 

The method is described as follows: 

Users categorize their messages into different types 
Setup and Encrypt are the same as in the Boneh- 
Franklin scheme 

Re-Encrypt(msg,type,msg_id) : the algorithm outputs 
the cipher-text ‘submsg’ = (msgl,msg2,msg3) based 
on the message and the type given by user. Each sub 
message is meant to be decrypted by the respective 
receiver and no one else. 

Decrypt(sub_msg,skid) : Given a cipher-text 

‘submsg’ = (msgl,msg2,msg3), the algorithm 
outputs the message ‘msg’ based on the ‘skid’ of the 
receiver. Hence every receiver gets the sub message 
intended for him and nothing more. [7] Another 
scheme and its construction is discussed in [15] based 
entirely on type of the user is also discussed and its 
various versions are implemented. 

B. Conditional Proxy Re-encryption Scheme 

In situation where fine-grained delegation is required 
requiring fulfillment of a predetermined condition, 
the notion of conditional proxy re -encryption (or C- 
PRE) was introduced, whereby only cipher-text 
satisfying one condition set by Sender is allowed to 
be transformed and then decrypted by receiver. The 
scheme is proven to be CCA-secure. The scheme is 
now improved to work based on multiple conditions 
rather than one as was its initial version. The 
conditions can be anything specified by the involved 
parties and the construction of the algorithm. They 
can be a set of pre-defined integers, the sending or 
receiving conditions of the parties, the physical 
location of the sender or the receiver. The message to 
be sent is encrypted using the receiver’s public key 


and the condition. Similarly to decrypt the message 
the receiver should meet the pre-defined conditions. 
The challenge now remains to construct CCA-secure 
C-PRE schemes with anonymous conditions rather 
than known predefined conditions. [10] 

C. Attribute Based Proxy Encryption Scheme 

The Attribute based proxy re-encryption schemes 
provide a better option especially when 
impersonating a user is an active issue. Moreover the 
problem of authentication of a user is easily solved 
by this. Attribute based PRE involves various user 
attributes like city, country, street number, GPS 
coordinates, or any other set of attributes that are 
predefined while encryption. When a user possesses 
these attributes only then is the decryption of a 
message possible and allowed. The identification of 
these attributes is based on a certain threshold i.e. if 
the attributes of the receiver match the required 
attribute set by a certain degree or level, the 
decryption access is granted and the message can be 
decrypted by only using these attributes and the 
secret key. So even if a single attribute doesn’t meet 
the threshold the whole decryption fails. This is a 
general scheme whose various modifications exist, 
namely Cipher-Text policy attribute based encryption 
and Key policy attribute based encryption which are 
widely implemented. This mechanism is joined with 
the proxy re-encryption and implemented in various 
categories. 

D. Key Private Proxy Re -encryption Scheme 

Key Private Proxy Re-Encryption also known as 
Anonymous Proxy Re-Encryption introduces the 
notion of keeping the keys private such that even the 
proxy that performs the transformation of message 
cannot identify or differentiate between the involved 
users. None of the early PRE schemes provided key 
security. This scheme is CPA-secure but work is still 
in progress regarding CCA- safe key private PRE 
schemes. If a proxy communicates with multiple 
users it should not be able to reveal to a user what 
other parties are communicating with it from the 
message being transmitted or the set of re -encryption 
keys available. This information should not lead to 
the users. The necessity and benefit of a key private 
scheme is that nobody can detect who has access to a 
certain message i.e. complete anonymity of the users 
involved in a communication. [9] 

E. Ciphertext-Policy Attribute based Proxy Re- 
encryption: 

Ciphertext-Policy ABPRE is a joint construction of 
attribute-based encryption and traditional proxy re- 
encryption scheme. It is proven to be secure against 
CPA. It is a type of ABE where the key is associated 
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with an access structure namely a group of attributes 
defining the type of user that should be given access 
and decryption rights. This solves the issue of 
multiple users and key distribution over a large 
audience. Key management creates an overhead in 
such situations and this algorithm is beneficial in this 
context. Recent variations of this algorithm are 
proven secure against chosen ciphertext attacks under 
decisional q-parallel BDH assumption [11]. This 
algorithm has widespread applications in medical 
domains where patient records are continuously 
being transferred and referred from one doctor or 
facility to another. It provides a fine grained access 
control to the user over the delegates enabling it to 
specify who can decipher the data or message by 
setting with it a set of attributes [13]. CP - ABPRE 
scheme is a collusion resistant uni -directional scheme 
and is associated with a monotonic access structure. 
A CCA secure version of CP-ABPRE is also 
constructed in [16]. 

F. Time/Clock Based Proxy Re -encryption Scheme 
A cloud environment is composed of several 
independent servers communicating to provide 
services. In a time based re-encryption scheme, each 
cloud server is allowed to independently re-encrypt 
data automatically in contrast to the previous 
methods where the data was encrypted only after 
receiving a command from the sender [18]. This 
allows an automatic re-encryption of data based on 
the internal time of the cloud servers rather than by 
manual commands. The data is associated with a 
control structure for defining access and a time for 
which the access is granted [18]. Hence every piece 
of data stored in the cloud is associated with a set of 
attributes that define the type of user the data is 
meant for and a time structure which basically 
specifies the time limit for which the data will be 
accessible to the user. The receiver is issued keys that 
become effective during the specified access times, 
implying that the receiver can decrypt the message 
using only those keys which match the access time. 
The data owner and the Cloud Service Provider share 
the secret key. This key is later used to create sub- 
keys for the users and when re-encrypting the data 
along with the clock time of the system. This 
combination of access structure facilitates user 
revocation and distribution of delegation rights. The 
algorithm is based on the Bilinear Deffie -Heilman 
assumption like most proxy re-encryption schemes. 
The algorithm operates in the following mechanism. 
First the algorithm is setup by generating the master 
key, public key and defining a universal attribute set 
from which the individual attributes will be late 
selected. Then the CSP identifies all its users and 
generates secret keys for them based on their attribute 


sets. The data is then encrypted based on the above 
mentioned access structure. Now when a user 
requests for a certain data, it is re-encrypted with the 
internal time of the system, hence setting up a valid 
access time for decryption by the user. 

Therefore a user satisfying the access structure i.e. 
the attribute set can successfully attempt decryption if 
the time hasn’t expired [6]. 

G. Threshold Proxy Re-encryption Scheme 
There are three problems in a decentralized cloud 
storage system. First, high level of traffic between the 
user and storage servers leads to more computation 
by the user. Second, key management becomes a 
problem for the user because security is broken if the 
user’s keys are compromise. Thirdly, directly 
forwarding a user’s messages to another one is not 
feasible. 

The proposed system is constructed around the 
proposed scheme named Threshold Proxy Re- 
Encryption. In the beginning the cloud storage 
system stores user details in some database. The user 
needs to get registered in the database, by entering 
his data like user_name, user_gender, user_location, 
user_password, user_birthdate, and user_e-mail 
address. The user then logs into the system using his 
credentials that were initially registered. The file is 
forwarded contained in a folder along with the user 
and recipients name, a security question for 
decryption access, the file containing the key for 
decryption and the status of the message. The file is 
transferred using the receiver’s email and public key. 
After the file is received by the receiver, the selected 
file is downloaded. But before downloading the file, 
he has to download the key file that was sent in the 
same folder. In order to download the key file, 
receiver has to enter the following details like file 
name, the secure question and its answer. Now the 
key is revealed to the receiver with which the 
message can be downloaded and decrypted. [8] 

ANALYSIS 

Type Based PRE provides semantic security and 
cipher-text privacy control but on the other hand 
encoding operations over encrypted messages is not 
possible limiting its widespread use. 

Key-Private PRE provides security against Chosen 
cipher-text Attack but the privacy proof of this 
scheme is more difficult than Chosen plaintext attack. 
Identity -based PRE is secure against an adaptive 
CCA but it is difficult to find such constructions for 
the algorithm that are multi-use, efficient and CCA 
secured. 

Ciphertext Policy Attribute-Based PRE provides a 
fine grained access control over data by limiting the 
decryption writes based on various attributes of the 
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receiver but it has an average efficiency and 
flexibility compared to the other schemes. 

Conditional PRE schemes provide a very efficient 
mechanism against CCA but it is very difficult to 
design C-PRE schemes that are CCA secure. 

Time based PRE is a more recent modification of 
PRE schemes which provides a scalable user 
revocation and reduces the workload of data owners. 
The major disadvantage of this scheme is that it 
require s the effective time period to be same for all 
attributes associated with the user. 

Threshold PRE enables data forwarding efficiently 
but it requires very high access control which 
becomes difficult to provide. 

III. Applications Of PRE 

Proxy re-encryption has many exciting applications 
in addition to the previous proposals [Blaze et al. 
1998; Dodis and Ivan 2003[5]; Jakobsson 1999; Zhou 
et al. 2004] for performing cryptographic operations 
on storage-limited devices, law enforcement and 
most commonly in email forwarding. In particular, 
proxy cryptography has a natural application to 
secure network file storage: 

Secure File Systems: A secure file system is the most 
obvious application of proxy re-encryption because 
we always assume that a storage system will be non- 
trusted and in PRE the goal is to use a non-trusted or 


partially trusted party for re-encryption but avoid any 
harm to data. 

Outsourced Filtering of Encrypted Spam: The 
filtering of encrypted emails performed by 
freelancing contractors which is a requirement due to 
spamming and hoaxing performed by hackers and 
trouble makers is an application of proxy re- 
encryption that is equally applicable but less known. 
The amount of such emails has overwhelmed the 
filtering capacity of many small businesses. This has 
lead to a potential market for email filtering 
outsourcing. The advancement in techniques used by 
these hackers has rendered basic filtering measure 
useless. With the help of proxy re-encryption, 
incoming encrypted email can be forwarded to an 
external contractor for filtering at the first email 
gateway, without any risk of exposure of the 
underlying plaintexts. [4] This survey discusses in 
detail the various PRE schemes introduced till now 
starting from Mambo and Okamoto [2], their pros 
and cons and applications of each in respective fields. 

IV. COMPARATIVE STUDY 

The following table shows a comparative study of the 
PRE schemes discussed above based on the 
properties of directionality, multi-use, transitivity, 
interactivity, security, key-privacy, collusion 
resistance, and the assumption on which the 
algorithm is built: 


TABLE I 

Comparison of PRE Techniques 


Schemes/ 

Key Features 

Type- 
based PRE 
[15] 

Identity- 
based PRE 
[14] 

Key-private 
PRE [9] 

Conditional 
PRE [10] 

Clock-based 
PRE [6] 

Threshold- 

based 

PRE[8] 

Ciphertext 

Policy- 

ABPRE[11] 

Unidirectional 

/Bidirectional 

Bi 

Uni 

Uni 

Uni 

Uni 

Bi 

Uni 

Multiple-use 

No 

Yes 

No 

No 

No 

- 

Yes 

Transitivity 

No 

No 

No 

No 

No 

No 

No 

Non-Interactive 

Yes 

Yes 

Yes 

Yes 

Yes 

No 

Yes 

Key-private 

Yes 

- 

Yes 

- 

- 

- 

- 

Collusion-resistant 

No 

No 

Yes 

Yes 

- 

Yes 

Yes 

Fine-grained 

delegation 

Yes 

Yes 

- 

Yes 

Yes 

Yes 

Yes 

Ciphertext-private 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

Key-pairs 

1 

- 

1 

- 

1 

- 

- 

Secure against 

CPA, CCA 

CPA(if multi- 
use), CCA(if 
single use) 

CPA 

CCA 

CPA 


CPA, CCA 

Assumption 

DBDH, Co 
BDH 

DBDH 

DBDH 

3 -quotient 
BDH 

BDH 


Decisional 

q-parallel 

BDH 
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V. Conclusion 

This paper briefly discusses various proxy re- 
encryption schemes, their general mechanism and 
implementation. They are then broadly classified 
based on directionality and a comparison is given 
after analyzing the schemes for traits that should be a 
part of every successful proxy re-encryption 
algorithm. 

Future work on proxy re -encryption should include 
features of key-privacy and transitivity. Since most 
schemes are collusion resistant and key -private but an 
efficient mechanism also providing transitivity is 
missing. 
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Energy Efficient Routing Protocols in Wireless Sensor 

Networks: A survey 

Owais Khan, Fiaz Gul Khan, Babar Nazir, Usman Wazir 


Abstract — WSN is an evolving technology since last ten years. As 
wireless nodes work have less power supply in the form of a battery, 
it is necessary for the nodes to work for maximum time. Different 
techniques are adopted to achieve better energy optimization. This 
paper presents a survey on energy efficient routing techniques, 
which will help in understanding the factors which affect energy 
efficiency and other performance parameters and will help to 
analyse the techniques for further optimizations. 

Index Terms — Wireless Sensor Networks, Energy optimization, 
Topology. 


I. INTRODUCTION 

The great advance and inventions in computer hardware and 
software technology has the industry to create very small 
components, chips and this advancement created small sensor 
nodes which operate on low cost batteries. They have 
computational, sensing and transmitting components which 
capture data from vast environment and send it to a master node 
or sink node. This network is called wireless sensor network. The 
nodes have the ability to communicating with each other and also 
directly with BS. Sensor networks are gaining popularity since 
last decade and a lot of research work is going on in different 
WSN domains i.e. routing, scalability, network life time, data 
integrity and security etc. WSNs are largely adopted by 
industries and organizations like health care, agriculture and 
environment, military operations, safety and security, transport 
systems etc. WSNs are useful because of its low cost deployment 
and energy consumptions. However, energy consumption is 
considered to be the main problem as there are certain constraints 
on energy saving and computation in nodes. 
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In WSNs, nodes are deployed in an environment to gather data. 
The node deployment, number of nodes to be deployed depends 
on the kind of application for which WSN is being used. Nodes 
are either deployed in a specific order and position or they are 
deployed accordingly [2]. 

WSNs adopt different techniques to optimize energy 
consumption. Some of these techniques are radio optimization, 
data reduction, sleep/wake up schemes and routing protocols, 
battery replication [ 1 ] . 

This paper will present a survey on routing protocols. Routing 
can be optimized in so many different manners to optimize 
energy. Lots of work has been done in routing protocols for 
WSN. The main area or task in WSNs which consumes energy 
is the transmission phase. Routing can be manipulated in 
different ways to decrease energy consumption. This paper will 
present a survey on hierarchical routing protocols. Hierarchical 
approach use clustering of nodes which decrease communication 
distance, aggregate data, increase network life, minimize number 
of transmissions, thus decreasing energy consumption [3]. 

Let’s discuss some main challenges in routing protocol of 
WSNs. Node deployment either random or in specific order, data 
collection (time-driven, event driven or query driven), integrity 
of data while consuming minimum energy, network life [1]. 
Hierarchical protocols are further divided into sub categories 
which are grid based, chain based, tree based and area based. 
Different hierarchical techniques, their advantages and 
disadvantages will be studies in section 2. The composition and 
format of overall paper is as follows. 

In section 3, these techniques will be compared and their 
comparative analysis will be done against some parameters. In 
this section 4, the techniques will be analyzed in such a way that 
which technique is most suitable for a certain category of 
application. This section will also present trade-offs between 
different performances parameters. In 5th section, the problems 
addressed in section 2, will be given with some possible 
solutions, the section may also include some future work 
regarding the field. Section 5, will present some related work and 
in section 6, the paper will be concluded. 

The paper will present an analysis of these techniques in section 
1. Some problems in these techniques will be discussed. E.g. 
coverage and nodes heterogeneity problem in LEACH protocol, 
network-life problem in PEGASIS, some hybrid techniques will 
be discussed through which more energy consumption can be 
optimized along with performance improvement of other 
parameters like convergence time, scalability etc. The 
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classification of routing protocols in WSNs are shown in fig 1 
and 2. 



Fig: 1. Overall Routing Techniques in WSNs 



Fig: 2. Hierarchical Routing Rechniques 


DIFFERENT HEIRARCHICAL ROUTING TECHNIQUES 
1 . Cluster-based Routing 

In roting protocols of this category, the nodes form groups, with 
each group having a master node called cluster Head i.e. CH 
which is responsible for collecting data from all its group 
members, aggregate it and send it to the sink node where as in 
common method, the nodes either transfer the data directly to 
sink or send to other nodes along the path to the sink. This 
clustering has an advantage of less range of transmissions as 


nodes does no communicate with a distant Base Station (BS) or 
sink node but they communicate with their respective CH which 
in turn sends data to the sink, Thus reducing the length of 
transmissions for most of the node. The following are some of 
the widely used cluster-based protocols used in WSNs. 

1.1 LEACH (Low-energy adaptive clustering hierarchy) and 
LEACH-C 

This is the most popular of all hierarchical routing algorithms. 
In this protocol, the nodes form a cluster with a master node as 
described above. But if a node remains master node for a long 
time, its battery will deplete very quickie which will reduce 
network life, for this purpose, the CH is changed randomly for 
each round. A round means when a CH collects data from all 
nodes and send it to BS. In this manner, the power consumption 
is distributed and balanced. It minimizes energy efficiency by 
reducing transmission range as the node communicate with CH, 
instead of directly communicating with BS. Similarly data is 
aggregated which results in more energy optimization. The 
process of formation of cluster and CH selection is carried out in 
the following manner. When a round starts, every node compete 
for CH. CH is selected on certain criteria, which is (i) how many 
CHs are required for this system i.e. how many groups need to 
be created (ii) how many time a node has been selected as CH 
before. For this, the nodes use the following equation. 

_ I P/1 — P ( r mod I) if n € G 
( 0 , other wise 

Where G represents all nodes, each G chooses 1 or 0, if the 
selected number is less the above threshold, it is selected as CH. 
After selection of CH, the nodes connect themselves to the 
nearest CH, depending on the signal power of CH and thus forms 
a cluster. This s technique has the benefits of less range of 
transmissions, load balancing and data aggregation, however it 
suffers from some problems i.e. in nodes distribution, if one CH 
has more nodes than the other, this CH will consume more 
energy comparatively. For this, the clusters need to be almost 
uniform with respect to number of nodes. This problem can be 
solved easily for applications with fixed topologies but for other 
applications, it is comparatively difficult to make the clusters 
uniform. Another problem is, when there are CHs very near to 
each other i.e. coverage problem, if these two cluster act as a 
single cluster, they will consume less energy as compared to two 
clusters, in other words, we can say that the CH should have 
some minimum distance in order to create minimum clusters in 
the system. For above given problems, some possible solutions 
are discussed in section 3 of this paper. 

A variation of LEACH is LEACH-C (Centralized LEACH) 
which solves the problem of CH selection and coverage in its 
own way. The BS is selected by BS in the start. This is a 
proactive approach for topology creation. In the start, BS 
receives location and residual energy information of all nodes. 
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BS calculates average energy for a set of nodes and the node 
having lower energy than the average is removed from 
nominated list of CHs. BS uses annealing algorithm for cluster 
formation y minimizing sum of squared distance between nodes 
and CH. As in LEACH and LEACH-C, CHs have to 
communicate with all of the nodes, so they use TDM A scheme 
for communication with all of the nodes [2] . 

1.2 TEEN ( Threshold sensitive energy efficient protocols ) and 
APTEEN (Adaptive TEEN) 

This is another cluster-based routing protocol and forms cluster 
by the mechanism mentioned in LEACH. But unlike LEACH 
and LEACH-C, this is a reactive protocol. The nodes in this 
technique are almost in sleep mode and whenever there is a 
sudden change in the parameters of sensed data, the nodes 
become active and collect data. TEEN and APTEEN uses some 
threshold parameters to make the nodes active. Two types of 
thresholds are used, hard threshold, in which there is a specific 
limit for the value change and when that change occurs, the node 
must become active. Another is soft threshold, in which a little 
change in value attributes cause the nodes to become active. The 
threshold attributes are provided to all nodes by the CH once it 
is selected [2] . 

TEEN is having a problem, as TEEN only send data whenever 
any of the above threshold is met, otherwise the nodes will 
remain in the state of sleep and there will be no information about 
network condition or topology. Similarly, one cannot find out if 
the system is up or not. This problem was solved in APTEEN 
which uses the above thresholds for node activation, as well as it 
periodically sends data to the BS, giving a snapshot of the 
network. In this manner, it works as a proactive as well as in 
reactive manner. TEEN and APTEEN also uses TDMA just like 
LEACH and LEACH-C. 

TEEN and APTEEN can be combined used with some other 
protocols and performance of multiple parameters can be 
achieved [2]. Some suggestions are given in section 3 of the 
paper. 

1.3 Energy Aware routing Protocol (EAP) 

This technique provides another way for CH selection and area 
coverage. Its uses address tables with each node to know about 
their neighbors. Every node broadcast E-message to other nodes 
and calculate distance from each other. Apart from this, the 
residual energy is also considered before the selection of CH. 
This method uses inter-cluster and intra-cluster information to 
achieve better energy optimization. 

2. Chain-Based Routing 

In Chain-Based Routing, the nodes form chain or chains in the 
network with a leader node at random position for each round. 
The data from nodes travels along the chain to the leader node 
which aggregate and sends the data to BS or sink node. The main 
advantages of this topology is that its topology is very easy to 


construct, similarly, as the data is transmitted to neighbor nodes. 
So transmission distance is less, thus consuming less energy. The 
main drawback in chain-based routing is that if a single node in 
the chain fails, the whole network fails. Following are the some 
chain-based routing techniques with some problems mentioned. 

2.1 PEGASIS (Power Efficient Gathering in Sensor Information 
System): 

In this technique, chain is formed and data travels along the 
chain as mentioned above. The leader nodes position changes for 
each round, so energy consumption is divided among all the 
nodes. However, PEGASIS is suffer from the following 
problems: 

(i) What if a node with lesser energy remaining is selected as 
leader node, (ii) If a leader node with maximum distance from 
BS is selected, causing delay [4]. Selection of node with 
minimum residual energy may reduce network life. Some of the 
possible solutions are proposed for above problems in section 3. 
A hybrid technique is also proposed to increase network life time 
and consume lesser energy. For leader nodes, it is necessary to 
communicate directly with BS. So PEGASIS is more suitable for 
applications with fixed topology. 

2.2 Concentric Clustering Scheme (CCS) 

The topology of CCS is incredibly good and it also solves many 
problems of PEGASIS in its own way. CCS forms logical 
circular chains around BS, just like orbits around a nucleus of an 
atom. Each chain has a cluster head (CH), which gathers data 
from its member nodes. The nearest chain to BS is called level 1 
chain, 2 nd is level 2 and so on. The distance problem between CH 
and BS in PEGASIS is solved in a way that the CH of higher 
level sends data to the lower level’s CH and so on. Data to BS is 
transmitted by the nearest CH, thus consuming less energy. 
However, it will also suffer a great transmission delay as data 
travels from higher level CH to lower level CH and then to BS. 
Also the nodes are nearest to BS suffers more energy depletion, 
as the whole data goes through these nodes to BS. 

2.3 Energy-Balanced Chain Cluster Routing (EBCRP) 

In this routing technique, the nodes are divided into rectangular 
clusters, the nodes in each rectangle or clusters form chain with 
CH. This CH collects data from all nodes and send it to BS. The 
energy is balanced in such a way that each CH will remain CH 
until all of its energy is depleted. 

This technique has three phase, (i) Cluster formation (through 
ladder algorithm) (ii) CH selection and (iii ) The steady state 
phase. Once a CH is selected and remains CH until it dies, this 
state is called steady state phase. After steady state phase, a new 
round is started and another node become CH. 

This technique is suffer from transmission delay because of two 
reasons, if there are nodes (successive) far from each other. The 
other reason is the direct communication between CH and BS. A 
possible hybrid solution for this problem is given in section 3. 
(Multi-layered just like CCS). 
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2.4 Chain-based Hierarchical Routing Protocol ( CHIRON ) 

It consists of four phases (i) group creation in which nodes are 
divided into fan-shaped areas. BS has the information about all 
the nodes and their residing groups (ii) chain -formation, this is 
done as follows, the farthest chain from BS is selected as start of 
the chain and the successive node connect to it and the successive 
node to second node attach to the next node and so on, creating 
a chain of nodes in each group (iii) leader selection, in the start, 
the farthest node is selected as chain leader, and then in the next 
round, the node with high residual energy is selected as chain 
leader, (iv) This is the data transmission phase, the data is first 
sent to chain leader which then forwards it to the next chain 
leader and then to the BS. In this way, CHIRON follows multi- 
hop and short haul transmission because the aggregated data 
flows through several chain leaders up to the BS. However, it the 
chain leader nearest to the BS will suffer from quick energy 
depletion and that is the drawback of CHIRON. The groups are 
also divided uneven which cause uneven energy consumption 
resulting in reducing network life. 

3. Tree-Based hierarchical routing protocols 
The topology of the nodes in this category form a tree like 
structure (logical). Data from leaf nodes goes up to parent nodes 
and then to their parents and son up to the root. This reduces data 
flooding and unnecessary data retransmissions as the data 
follows the same route. The energy consumption is minimized in 
a way that there is no such long distance communication as data 
is transmitted to immediate neighbors as in chain-based 
protocols. However this method also suffers from some 
drawbacks i.e. the topology is not robust, as data follows the 
same route or there is a single path from one node to root node. 
If a node in the path crashes, the overall network topology will 
have to be changed which will increase convergence time. It also 
suffers from uneven energy consumption and scalability 
problems. Some of tree -based hierarchical routing protocols are 
discussed below. 

3.1 EADAT (Energy -Aware data Aggregation Tree): 

The technique focuses on the issue of energy consumption 
considering energy aware data centric routing. The tree is created 
keeping the residual energy of every node in consideration. The 
tree construction is performed in the following manner, initially 
the sink node, which is also considered as root node, sends 
control message to every other node. Each node has a timer 
associated with it, lower the value of the timer, higher its residual 
energy is. A leaf node selects a node with higher residual energy 
and its shortest path to the sink and make that node its parent. 
This process continues up to sink until a full tree is constructed. 
When a node’s energy becomes less than some pre -determined 
threshold, it broadcasts help message to all other nodes and the 
shutdown, now the nodes attaches to it either as parents or child 
will create a new topology according to above mechanism [9]. 


The protocol is data centric in a way that the nodes with higher 
residual energy and shortest paths become more responsible and 
more data flows through them as compared to others, this 
increases network life as well as energy consumption is even 
around the system. But the mechanism described above for 
creating tree sometimes create a longer path than the real 
minimum path which causes transmission delay as well as more 
energy is consumed. 

3.2 BAT (Balance Aggregation Tree) 

In this technique, a balanced tree according to energy 
consumption of each node. This is created in such a way that 
initially the BS is considered as root node and it is assumed that 
it has information about the positions of all nodes. In the start, a 
minimum weighted edge is selected and as much as child nodes 
are connected to it, the new node is connected to tree, if a 
neighbor node is found, the node is called leaf node. In short, this 
technique creates a minimum spanning tree in which weight 
parameter for edges is the “energy dissipation” as cost. 

This technique achieves a good energy efficiency, however it 
does not consider the residual energy of nodes while creating 
tree, which can reduce network life [10]. 

3.3 ETR (Enhanced Routing tree) 

This a modified and enhanced version of TR (Tree Routing) 
whose objective is to introduce balance between performance 
and cost.in this technique, a minimum cost path is created up to 
the sink for each node. This is done by using a table called 
“Neigh our Information Table” which has addresses of 
successive neighbors. Each node has this updated table. For path 
selection, ETR uses a parameter called “Network Depth” which 
is the hop-count from one node to the sink or sink to node. The 
value of “Network Depth’ is 0 for root node and for the rest, it 
increases as a node comes in path. ETR selects shortest path i.e. 
minimum network depth value for each node to the sink. This 
protocol has many advantages as less range of transmission, 
shortest path selection but just like BATR, it also ignores 
residual energy which is great drawback in ETR. 

4. Grid-Based Routing Protocols 

In grid-based routing, the nodes the nodes are dispersed in given 
area on the basis of some geographical constraints that is why 
this is also called location-aware routing. 

The advantages of this category contain efficient data delivery as 
each node has a deterministic set of nodes to which it transfers 
data. However it suffers from load balancing as there may be 
more data in one grid than the other. Another problem is almost 
fixed routing is used in this type of routing, having no alternate 
routes, ignoring traffic or load considerations. Some of grid- 
based routing protocols are discussed below. 

4.1 PANEL (position-based aggregator node election) 

As its name suggests, it selects some data aggregators on the 
basis of some position information criteria. The nodes are 
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divided into geographical clusters. An aggregator is selected for 
each cell, with respect to the lower left corner of the cluster. The 
communication formats re of two type, inter-cluster in which 
data is sent to a single cluster closer to BS and intra-cluster in 
which data is sent or aggregated to an aggregator and then the 
data is sent from aggregator to sink or BS. 

PANEL provides load balancing as aggregators are changed after 
sometime or after each round. However the selection of 
aggregators based on geographical locations needs extra 
complex technology on both hardware and software side, thus 
may be considered cost-effective. 

4.2 TTDD (Two -Tier Data Dissemination ) 

In this approach, nodes are divided into different grids with 
several dissemination nodes for spreading or sending queries to 
source node. This approach also has multiple mobile sinks. 
When a sink needs some data, it sends queries to dissemination 
nodes which forward queries to source nodes. The sink can move 
from one grid to another and broadcasts queries to all nodes of 
the grid. When a sink moves from on grid to another, it selects a 
reference node called bridge node which forward data to sink 
which has moved to another cell or grid. This is a good technique 
for event-driven and on-demand data applications [11]. 

4.3 HGMR ( hierarchical geographic multicast Routing) 

This protocol is a hybrid of GMR and HRPM. The objective of 
GMR is to enhance forwarding while that of HRPM is reducing 
encoding overhead. HRPM divides nodes into cells with an AP 
(Access Point) which has destination information about all nodes 
in the cell. The Aps are managed by a rendezvous point. Two 
types of trees are constructed for communication, source-to-AP 
and AP-to-member tree. The data is transferred to different Aps 
at different levels until it reaches the lower level AP which is 
then forwarded to BS or sink. In HGMR, the nodes are given 
different tasks with different responsibilities and loads, thus less 
energy is consumed. 

5. Area based Hierarchical routing protocols 

In this type of routing, some of the nodes are selected as master 
nodes or act as high tier node. These are responsible for 
collecting data from other nodes and to forward data to sink 
node, this is a useful approach for mobile WSNs. The advantages 
of area based routing is that a specific area is selected so topology 
implementation is easy comparatively and high tier node can be 
selected very easily. Another advantage is that just like other 
cluster and chain topologies, the data is transmitted locally 
avoiding large distance communication, thus less energy is 
consumed. However, this approach also suffers from some 
drawbacks which include: scalability because for large reason 
the data broadcast may result in high energy consumption. As 
this approach is largely used for mobile WSNs, so cost is high 
for their implementation because some extra technologies are 
needed for the deployment. 


The following are some of the routing techniques based on this 
category. 

5.1 LBDD ( Line -based Data Dissemination) 

In this method, the nodes are divided into two parts creating a 
fence called vertical strip or line of nodes. This line acts as a 
storage area and all the data is sent to this line or inline nodes 
before sending it to the sink the nodes within the area of line are 
called inline nodes. The operation is performed in two phases, in 
first phase, a node generates new data and send it to the nearest 
inline node. In the second phase, the sinks sends query to the 
vertical strip and the query is flooded to all the nodes of the strip, 
upon receiving the query, the nodes having some data received 
from other nodes send data to the sink node [12]. This technique 
experiences the problem of load balancing because the strip 
nodes are responsible for data transmission, if there are less 
number of nodes on the strip line, they will deplete energy very 
quickly, resulting in reduced network life [14]. 

5.2 VLDD ( Virtual Line -based Data Dissemination) 

This topology works on Virtual Line Structure (VLS). This is a 
specific region with nodes in a chain form. The data is gathered 
on this line and then sent to sink. If a node wants to send data to 
VLS, it calculates entry point or node to VLS, the shortest path 
is selected and data is sent to that node, this data is transferred to 
the neighbor node of VLS until it reaches the exit point. Now the 
sink sends query for data to VLS, unlike LBDD in which query 
is broadcast to all nodes of strip line, the query is sent to exit 
point of VLS. If VLS has data on its exit point, it notifies the sink 
by create a flag with value = false, otherwise the data is taken 
from other nodes of VLS and sent to sink [13]. 

The techniques provides good energy efficiency by avoiding 
flooding, however the exit point of VLS may suffer more from 
energy depletion. 

5.3 Ring Routing 

In ring topology, nodes form a ring for collection of data from 
nodes and transmission of data to sink. After the formation of 
nodes, the neighbor nodes attach to the ring and transmit data to 
different nodes in the ring. The ring nodes change time to time, 
therefore the problem of network failure is minimized. The ring 
acts as a rendezvous for the tasks and 

Execution queries. The sink gets data from the ring by sending 
queries and its location. The topology implementation also easy 
in ring topology just like in LBDD [14]. 

5.4 Railroad: 

This is a proactive technique in which a topology is created with 
a specific area which contains Meta data for the actual data. This 
is called rail and is located in the middle of the network. 
Whenever a query is generated by the sink for data, it is sent to 
rail, the rail looks up for Meta data of that query and informs the 
source node, the source node delivers that data to sink node. The 
difference between LBDD and railroad is that sin LBDD, the 
sink node sends data to all nodes of strip line while in railroad, it 
unicasts the query [60] . 
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COMPARATIVE ANALYSIS OF HIERARCHICAL 
ROUTING PROTOCOLS AGAINST SOME 
PARAMETERS 

In this section, the above categories and their mention techniques 
will be analyzed and compared on the basis of the following 
parameters: Energy Efficiency, Transmission Delay, Scalability, 
Load balancing, Network Life, Data aggregation and criteria for 
CH selection. Each technique follows a different topology and 
mechanism. One topology favors scalability but suffers network 
life problem, some topologies provides better data aggregation 
while others are good for convergence time. This section will 
give us a general and broad idea about how a topology achieves 
certain performance parameters and how it suffers from some 
drawbacks. In the next section, keeping this discussion in view, 
we will suggest a suitable topology or technique for specific 
application areas. This section will discuss the performance 
evaluation on the basis of above of parameters category -wise. 
Data aggregation is common to almost all of the protocols, as 
data is gathered by a single or multiple master nodes and then 
sent to sink. The performance analysis is peresented in the 
form of table in Table 1. 

Cluster-Based Routing protocols : The network topology of 
cluster-based routing i.e. LEACH, LEACH_C, TEEN and 
APTEEN suggests that the nodes can be dispersed in a wide area 
so scalability can be achieved efficiently, because the group 
formation can increase the area for network. However, as the 
farthest CH may cause transmission delay, so there should be a 
limit regarding scalability [3], but normally data delivery is good 
in cluster routing, Energy efficiency is also good for these 
protocols but lesser as compared than those of tree and chain 
based topologies. The BS is almost fixed for all pf these 
protocols. Load-balancing is good. Network life is enhanced by 
considering residual energy, similarly convergence time for 
cluster-based routing is very little which is a very good feature 

[41 

Chain-Based Routing Protocols: energy efficiency is very Low 
for chain-based as compared to others. This is because of the 
reason that sometimes the CH is too far from the BS that it has 
to consume more of the energy to transmit data to BS. But as 
CHIRON uses multi-layered mecahnism for data transmission, 
so it has comparatively good energy efficiency as compared to 
other chain-based protocols. Scalability in chain-based protocols 
is very less because the nodes form long chains, so it is not 
scalable to a large extent.chain-based protocols suffer greatly 
from the problem of transmission delay because of long chains 
as well as multiple level of chains, so data delivery rate is high 
for these protocols, even CHIRON suffers from this problem. 
Load balancing is good because the data transmission burden is 
transferred to each node in turn. The algorithm complexity of 
PEGASIS is more because it acquires the global knowledge of 
all nodes while, algorithm comlexity for CHIRON and others is 
less comparatively as they do not need global information of all 
nodes. Data aggreagtion in almost all types of chain -based 
protocols. 

Tree-based routing protocols: Energy efficiency for is better 
compared to chain and grid based because the transmission 


distance is very little, as the nodes communicate with immediate 
neighbors, however it may result in delayed data transmission 
from leaf node to root node if the leaf node is too far from sink 
node. Load balancing is also greatly achieved because in the tree 
construction, the nodes with more residual energy has more 
responsibility for delivery of data up to the sink. The data in tree- 
based protocols travel through so many nodes causing delay, so 
it also limits the scalability of the network. Network life is also 
good because load is balanced across the network. Tree-based 
topology suffers from data delivery time if data has to be sent 
from leaf node to sink or root node. In some techniques of this 
category, spanning tree is created, which makes its algorithm 
complex as compared to others. Convergence time varies and 
depends upon the node’s level, convergence time is greater for 
nodes of levels nearer to the root node. 

Grid-based Routing Protocols: As the topolgy of grid-based is 
multi-hop or multi-layered, the energy efficiency is good but 
lesser than tree and area based routing techniques. In this 
techniques, aggregator s are responsible for data gathering and 
transferring and each node has equal chances to become 
aggregator, so the load is balanced in the system but some of the 
techniques suffer from imabalnced load problem, so the overall 
load balacnig can be considered moderate. Network life is also 
moderate. Scalability is good because the datapassed through 
less number of hopes, favoring scalability. Trnasmission delay is 
moderate, the farther nodes suffer from great delay as data has to 
travel through many levels. Convergence time is good as 
compared to tree and area-based routing. Cluster and grid-based 
routing protocols are almostequal in convergence time of 
network. 

Area-based routing techniques: The Topology of area-based 
routing is almost the same as grid-based routings. Area-based 
routing achieves certain performance parameters, these are 
discussed as: area-based routing performs flooding which results 
in large energy consumption, but this can be minimized if the 
network is not too much large. Load is imbalanced in the system 
because the nodes on the line strip have more responsibility as 
compared to others. However, railroad and ring protocols 
achieve some load balancing. Area-based routing suffers alot 
from the problem of transmission delay because of 
flooding. algorithm comlexity is low as compared to other 
categories. Convergence time is less because lesser nodes are 
dependent upon each other. 
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Category 

Protocol 

Data 

aggregation 

[4] 

Energy 

efficiency[4] 

Scalability 

[4] 

CH selection 
[5] 

Load 

balancing 

[4] 

Complexity 

[4] 

Network 

life 

Convergence 

Time 

C 

L 

U 

S 

T 

E 

R 

LEACH 

yes 

moderate 

good 

proactive 

good 

moderate 

moderate 

modertae 

LEACH-C 

Yes 

Modertae 

Good 

Proactive 

Good 

high 

Moderate 

Good 

TEEN 

Yes 

Good 

Good 

Proactive & 
Reactive 

Good 

Moderate 

Good 

Moderate 

REAP 

Yes 

Moderate 

Good 

Proactive & 
Rective 

Good 

High 

Moderate 

Moderate 

C 

H 

A 

I 

N 

PEGASIS 

Yes 

Very low 

Very low 

proacive 

Moderate 

High 

Moderate 

Less 

CCS 

Yes 

Very low 

low 

Proactive 

Bad 

Moderate 

Moderate 

Less 

EBCRP 

Yes 

Very low 

low 

Proactive 

Moderate 

Moderate 

Good 

Less 

CHIRON 

Yes 

Moderate 

Low 

Reactive 

Moderate 

Moderate 

Good 

Less 

T 

R 

E 

E 

EADAT 

Yes 

Moderate 

Low 

Proactive 

Moderate 

Low 

Moderate 

Moderate 

BATR 

Yes 

Low 

Low 

Proactive 

Bad 

Moderate 

Moderate 

Moderate 

ETR 

Yes 

Moderate 

Moderate 

Reactive 

Bad 

Low 

Low 

Moderate 

G 

R 

I 

D 

PANEL 

Yes 

Moderate 

Moderate 

Reactive 

Good 

High 

Moderate 

Good 

TTDD 

Yes 

Very low 

Moderate 

Reactive 

Good 

Low 

Moderate 

Moderate 

HGMR 

Yes 

low 

High 

Proactive 

Bad 

Low 

Moderate 

Good 

A 

R 

E 

A 

LBDD 

Yes 

Moderate 

Moderate 

Reactive 

Low 

Moderate 

Low 

Good 

Ring 

Yes 

Moderate 

Moderate 

Proactive 

Good 

Moderate 

Low 

Good 

RAILROA 

D 

Yes 

Moderate 

Moderate 

Proactive 

Good 

Moderate 

Moderate 

Moderate 

VLDD 

Yes 

Moderate 

Moderate 

Proactive 

low 

Moderate 

low 

good 


TABLE 1 . PERFORMANCE ANALYSIS OF HIERARCHICAL ROUTING PROTOCOLS 
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II. DISCUSSION 

The above discussion and analysis can help us to determine 
which technique is most suitable for a certain domain of 
applications. This section is composed of two main points: (i) 
which of the routing category achieve most performance 
features, this can help us understand the importance and use of 
each category and (ii) on the basis of discussion in section 3, 
we will determine domain of applications for each technique. 
By analyzing the above discussion and given tables, we can 
see that area-based routing can achieve more performance 
parameters as compared to others for small networks, however 
it suffers from scalability and energy efficiency issues a lot in 
large networks. Tree and grid-based algorithms lie almost on 
the same line regarding performance parameters achievement. 
Cluster-based routing has poor performance achievements 
comparatively and the reasons are given in the discussion of 
section 3. 

Cluster-based routing is suitable for small-scale applications 
like agriculture because it is scalable up to some limit. Chain- 
based routing protocols are suitable for short-range networks 
such as health monitoring and Bluetooth applications. Tree- 
based routing can be efficiently used in small-scale networks 
and data-centric networks like smart home applications. Grid- 
based routing protocols are best for QoS and multicast 
applications. Area-based routing protocols use mobile sink 
nodes so they can be best utilized for large area networks. 
Here are some of the trade-offs between some performance 
parameters. Scalability normally suffers when data has to pass 
through multiple tiers and is also suffered form multiple hops. 
Similarly, load-balancing increases network life. The more the 
load is balanced among the nodes, the lesser network fails. 

III. FUTURE DIRECTIOS 

WSNs have been evolving since last decade and research has 
been going on in its different domains as was discussed in first 
section. WSNs are now collaborating the newly evolving 
technology “Internet of Things” and it can provide many 
challenges for the researchers to make both the technologies 
more efficient and useful for users. Similarly, work can also 
be done on different problems in above discussed routing 
protocols e.g. LEACH protocol has the problem of coverage 
and uniformity in clusters creation, this can be solved if there 
is determined some minimal distance between cluster heads, 
this will also result in more energy optimization because we 
have concluded that the lesser the CHs, the more is energy 
optimization. Similarly hybrids of different protocols can be 
proposed in order to achieve multiple performance parameters 
e.g. LEACH and REAP. QoS applications are rapidly 
increasing, so there is a need to work on this performance 
parameter. Another challenge is mobile sinks. This area also 
has lots of challenges for researchers. 

IV. CONCLUSION 

Above was a brief survey about WSN energy efficient routing 
protocols. Work can be done in different areas of WSN. 


Routing is also considered to be a major area which can effect 
energy utilization. The paper will provide some opportunity to 
grab basic knowledge about different routing techniques and 
future challenges. 
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Improved Face Recognition Rate Using 
Face Partitioning in Eigen 
And Fisher Feature Based Algorithms 


Harihara Santosh Dadi, Gopala Krishna Mohan Pillutla 


Abstract — Face partitioning technique is presented in this 
paper. Instead of directly giving the face to the face recognition 
system, first the face is partitioned in to different face parts using 
face partitioning technique. The face parts are namely mouth, 
left eye, right eye, head, eye pair and nose. Eigen and Fisher 
features based algorithms are considered for experimental 
purpose. These face part features are given to the SVD classifiers 
individually. The outputs of the classifiers are again given to the 
decision making algorithm. Based on the maximum likely hood 
principle, this decision making algorithm outputs a face. ORL 
data base is used for evaluating the performance of this new 
technique. The first two faces of all the 40 people in the data base 
are considered for testing and the remaining eight faces are used 
for training purpose. Results are separately calculated with and 
without face partitioning technique. Results show that face 
recognition rate is increased by using the combination of face 
partitioning technique and basic face recognition algorithm. The 
new algorithm is also verified on 8 different data sets. 
Experimental results show that this face partitioning is 
improving the face recognition rate both Eigen and Fisher 
feature based algorithms. 

Index Terms — Face Partitioning, Facial features, Recognition 
engine, Support Vector Machine, Decision making algorithm. 

I. INTRODUCTION 

Face recognition aims at identifying the person’s 
distinctiveness by comparing the facial features with the 
available face data base features. The face data base, with 
known characteristics, is referred as the face gallery and the 
input face requiring determining the identity is the probe. One 
of the problems in face recognition is identification, and the 
other is the authentication (or verification). Of the two, face 
identification is more tricky as it cross verifies the gallery 
completely for minimum variance. 

Numerous algorithms are developed on face recognition 
particularly in the last two to three decades. Improving the 
Face recognition rate is always the challenge ever since the 
first algorithm was developed. In 1991, Alex Pentland and 
Matthew Turk [1] applied Principal Component Analysis 
(PC A) which was invented in 1901 to face classification. This 
has become the standard known as the Eigen face method and 
is today an inspiration for all face recognition algorithms 
evolved. Nan Deng et. al. [2] introduced face recognition 


algorithm for occluded faces. This method is based on 
dictionary learning for sparse representation and sub classifier 
fusion (LSSRC). The advantage of this method is its ability to 
conduct fusion recognition based on different identification 
contributions of sub-classifiers. For more robust face 
recognition algorithms refer [18] - [19]. Le An et. al. [3] 
introduced face recognition in multi camera surveillance 
videos. They developed unified face image (UFI) by fusing 
face image from different cameras. This is more effective as it 
uses multi cameras for face feature extraction from different 
orientations. This algorithm needs a high experimental setup. 

As the facial features are more localized, the algorithms are 
becoming more insensitive to the common challenges like 
facial expressions, occlusions, illumination and pose 
variations. This is the inspiring force for us to develop this 
algorithm. 

In this paper we propose a novel approach for developing 
face recognition algorithm. Here, we divided the face in to 
face parts like head, nose, right eye, mouth, left eye, and eye 
pair. Paul Viola et. al. developed face parts detection 
algorithms [4] - [6]. The features like Eigen are extracted for 
these parts and given to the classifiers. The classifier compares 
the features of the probe and the features of the gallery in the 
database. Each classifier outputs the face part. All these face 
parts are again given to the decision making algorithm which 
finally generates the matched face. We compare our 
algorithm’s results with Eigen face feature algorithm. Finally 
we compare our algorithm with the standard face recognition 
algorithm, PCA. 

While numerous face recognition algorithms are being 
developed, the authors are comparing them with the existing 
ones very superficially and few simple comparisons are 
presented. Given that large set of techniques and the theories 
that are applicable for face recognition, it is evident that the 
detailed analysis and bench marking these algorithms is very 
crucial. Effort done by Universities and research laboratories 
in developing the data sets pushed the comparisons of face 
recognition algorithms to the higher level. CMC and ROC 
curves were introduced for comparisons. Apart from finding 
the recognition rate, these curves become the basis for 
showing the superiority of the author’s developed algorithms. 
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The contributions of this paper are as follows: 

• We develop a novel face partitioning algorithm 
based on localizing the facial features. This works 
well for finding out the face parts and is more 
insensitive to the illumination, pose and facial 
expression variations. As the features are more 
localized, the variations become substantially 
reduced when we see for individual face parts. 

• We presented a decision making algorithm which 
accepts different face part outputs from different 
classifiers and generates a face output. 

• Extensive comparisons are made by taking the 
performance metrics curves namely CMC and 
ROC and showed that the curves are effective for 
proposed algorithm compared with Eigen and 
Fisher feature based algorithms. 

The remainder of this work is prepared as follows. Section 
II reminds the related work. Section III presents methodology 
of extraction of Eigen features and about SVD classifier. 
Section IV Face partitioning algorithm is presented. Section V 
shows the experimental results. Conclusions are finally stated 
in Section VI. 


II. Related Work 

Face recognition methods mainly deal with images which 
are of large dimensions. This makes the task of recognition 
very difficult. Dimensionality reduction is a concept which is 
introduced for the purpose of reducing the image dimensions. 
PCA is the most widely used dimensionality reduction and 
also for subspace projection. PCA can supply the client with a 
lower-dimensional picture, a projection of this object when 
seen from its informative view point. This can be achieved by 
taking only the starting few principal components in such a 
way that the dimension of the transformed data is minimized. 
The linear combination of pixel values here in PCA are called 
Eigen faces. 

Two performance metrics curves are considered. 
Cumulative Match Score Curves (CMC) is the curve between 
the rank on the x-axis and face recognition rate on the y-axis. 
Receiver Operating Characteristics (ROC) is the graph 
between false acceptance rate and verification rate. ROC 
curves are more informative 

III. Face recognition algorithm 

A typical face recognition algorithm is presented in this 
section. For any face recognition algorithm, there are two 
phases. One is training phase and the other is the testing phase. 
In the training phase, the features of all the faces in the gallery 
are found and stored in the data base. Eigen features are taken 
in the sample face recognition algorithm shown below in the 
figure 1. In the testing phase, the features of the probe are 
calculated. These features and the features of the gallery are 


given to any of the classifier. SVD classifier is taken as 
example in the figure. The Eigen features of the probe and the 
Gallery are taken by the SVD. The classifier looks for the 
closest feature matching face from the gallery with the probe 
and gives that face as output. Figure 1 shows the sample face 
recognition algorithm block diagram. 



Output 
Face '12' 


Fig.l existing face recognition system 


IV. Proposed Method 
A. Face partitioning Algorithm : 


The Face image is partitioned in to different parts like Eye 
pair, mouth, left eye, nose, right eye and head. The face data 
set is divided in to data sets of these features. Each data set is 
divided in to testing and training sets. The Eigen features are 
found for the training data set. These features are given to the 
SVD classifier. In the testing phase the testing image is also 
partitioned in to image features mentioned above. Each part of 
the face is given to the corresponding classifier. The SVD 
classifier generates the output image for which the Eigen 
features are closely matched. All the classifiers generate 
different types of face images as outputs. All these are given 
as inputs to the decision making algorithm. In this stage, the 
optimal face is detected and given as output. 

1) Partitioning of Faces 

The images in the face data set are divided in to face 
features. Figure 2 shows the first faces of all the 40 members 
in the ORL database. Figure 3 shows the block diagram of the 
face partitioning algorithm used in the training phase. Here, 
each face image is divided into different parts. Figure 4 shows 
how the face parts are detected. Head in red, right eye in 
magenta, nose in blue, left eye in black, mouth in purple and 
eye pair in green. Figure 5, 6, 7, 8, 9 and 10 are the gallery of 
heads, mouths, eye pairs, left eyes, noses and right eyes of the 
first image of all the persons in the ORL database respectively. 
Figure 11 shows all the 10 face images of the first person in 
the ORL database. Figure 12 shows how the face parts are 
detected. Figure 13 shows the block diagram of the face 
partitioning algorithm used in the testing phase. Figure 14, 15, 
16, 17, 18 and 19 are the gallery of heads, mouths, eye pairs, 
left eyes, noses and right eyes of the all the ten images of the 
first person in the ORL database respectively. 
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2 ) Gallery images 



Fig. 2. First face image of all 40 people in the ORL database 



Eye Pair 

Fig. 3. Face partitioning algorithm in the training phase which partitions the 
face into head, right eye, mouth, nose left eye and eye pair. 



Fig. 4. Face parts shown in different colors. Head in red, right eye in magenta, 
nose in blue, left eye in black, mouth in purple and eye pair in green. 


a) Head Images 



Fig. 5. Head parts of all 40 people from ORL database. 


b) Mouth Images 



Fig. 6. Mouth parts of all 40 people from ORL database. 


c ) Eye Pai r Images 



Fig. 7. Eye pair parts of all 40 people from ORL database. 
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d) Left Eye Images 



Fig. 8. Left eye parts of all 40 people from ORL database. 




Fig. 12. Face parts shown in different colors for the first person from ORL data 
base. 


f) Right Eye Images 



Fig. 10. Right eye parts of all 40 people from ORL database. 

3 ) Probe Image 

The face images of the first person in the AT&T Database. 



Fig. 13. Face partitioning algorithm which partitions the face image into 
different parts in the testing phase. 


a) Head Image 
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Fig. 14. Head parts of the first person in ORL database 
b) Mouth Image 



Fig. 15. Mouth parts of the first person in ORL database 
c ) Eye pai r Image 
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Fig. 16. Eye pair parts of the first person in ORL database 
d) Left eye Image 



Fig. 17. Left eye parts of the first person in ORL database 


e) Nose Image 



Fig. 18. Nose parts of the first person in ORL database 
f) Right eye Image 



Fig. 19. Right eye parts of the first person in ORL database 

4) T raining Phase 

In the training phase, all the face part gallery features are 
extracted and individually trained by using any classifier. Here 
we extracted Eigen features and used SVD classifier. Figure 
20 shows the training of all the face parts in the training phase. 
Figure 21 shows the overall training phase of our proposed 
method. This way of igniting the recognition engine is 
introduced in this section. 
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Fig. 20. Training phases of head, right eye, mouth, nose, left eye, and eye pair 
galleries. 
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Fig. 21. Overall training phase of the proposed algorithm 

5 ) Testing Phase 

In the testing phase, the probe image is given to the face 
partition algorithm. The Eigen features of the face parts are 
extracted individually. Figure 22 shows how the Eigen 
features of first image of first person in the ORL database are 
extracted in the testing phase. Figure 23 shows the overall 
testing phase of the proposed algorithm. Figure 24 shows the 
face partitioned face recognition algorithm. The training and 
the testing phases are separately shown. 
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Fig. 23. Overall testing phase of the proposed algorithm. 
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Fig. 24. Face Partitioned Face Recognition System testing and training phases. 
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Fig. 22. Feature extraction of face parts of the probe image in the testing 
phases. 


B. Decision making algorithm: 

Let there are ‘iT classifiers for different face partitioned 
datasets. Each classifier compares the features of the gallery 
and the features of the probe. The classifier outputs the nearest 
face part from the gallery with the probe. The input to the 
decision making algorithm are the outputs from the SVM 
classifiers. Decision making algorithm compares all the face 
parts. The face with more number of face parts is produced as 
output. Here in our algorithm the face with more than two face 
parts is considered as the output of the decision making 
algorithm. Figure 25 shows the complete face partitioned face 
recognition system. 

Algorithm Decision Making Algorithm (DMA) 

1 . Let the total number of persons in the gallery be ‘p\ 

2. Let ‘a’ be the head part from the gallery which is matched with 
the probe F PH , ’b’ be the left eye part from the gallery which is 
matched with the probe F PL , ‘c’ be the right eye part from the 
gallery which is matched with the probe F PR , ‘d’ be the eye pair 
part from the gallery which is matched with the probe F PE , ‘e’ 
be the nose part from the gallery which is matched with the 
probe F pn? and T be the mouth part from the gallery which is 
matched with the probe F PM . 
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3 . 


4. 

5. 

6. 

7. 

8 . 

9. 

10 . 


Let ‘a’ belong to Fj face in the gallery, ‘b’ belong to Fj, ‘c’ 
belong to F k , ‘d’ belong to Fi, ‘e’ belong to F m , T belong to F n . 
Where l<i,j,k,l,m,n<p. 

/^Equals function outputs the total number of equals with the 
first argument among the other arguments*/ 

(i,01)=Equals(i and j,k,l,m,n) 

(j,02)=Equals(j and i,k,l,m,n) 

(k,03)=Equals(k and i,j,l,m,n) 

(l,04)=Equals(l and i,j,k,m,n) 

(m,05)=Equals(m and i,j,k,l,n) 

(n,06)=Equals(n and i,j,k,l,m) 

Output face image=Max(01,02,03,04,05,06) l<Output<p. 

/* Max function generates that face which corresponds to the 
Maximum of 01 to 06 as output*/. 
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Fig. 25. Face Partitioned Face Recognition System. 


V. Experimental Results 

Experiments have been conducted on proposed algorithm 
by taking ORL AT&T data base [11]. For training phase the 
first eight face images are taken and for the testing purpose the 
last two face images are taken. The face database is first 
divided in to six separate databases namely head, eye pair, left 
eye, mouth right eye and nose. Eigen features are extracted in 
the training phase for all the data sets. And in the testing 
phase, the Eigen features of these parts are taken and given to 
the SVD classifiers parallel. The outputs of these classifiers 
are again feed to the decision making algorithm. The output of 
decision making algorithm is the output of the proposed 
system. 

The individual results are shown for both the test images in 
table I and table II. The results for different data sets are 
shown in different columns. The last column is the proposed 
method. The green color indicates that the proposed algorithm 
is generating the correct output where as the original PCA 
algorithm is giving the wrong result. The red color indicates 
that even by using the proposed algorithm still some of the 
face images are not showing the correct output. There are 80 
test images and 320 train images. Out of 80 test images, 64 
images are correctly recognized by the PCA algorithm. 
Whereas by using the proposed algorithm, 76 test images are 
correctly recognized. There is an improvement of 15% in face 
recognition rate when compared with the PCA algorithm on 
ORL database. 

Figure 26 and 27 shows the 3-D graphs between testing 
versus training of test face image 1 and 2 respectively. 
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Fig. 26. Testing Vs Training Datasets for Face 1 test image. 
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Fig. 27. Testing Vs Training Datasets for Face 2 test image. 

TABLE I 

Outputs of different face recognition algorithms for test face 1 


Face 1 

Original 

AT&T 

Database 

PCA 

Algorithm 

Eye pair 
Feature 

Face 

Feature 

Left Eye 
Feature 

Mouth 

Feature 

Nose 

Feature 

Right Eye 
Feature 

Proposed 

Algorithm 

SI 

SI 

SI 

SI 

S7 

SI 

SI 

SI 

SI 

S2 

S2 

S2 

S2 

S2 

S2 

S2 

S2 

S2 

S3 

S3 

S3 

S3 

S3 

S35 

S30 

S38 

S3 

S4 

S4 

S4 

S4 

S36 

S4 

S4 

S17 

S4 

S5 

S17 

S5 

S5 

S9 

S17 

S5 

S5 

S5 

S6 

S6 

S6 

S17 

S13 

S4 

S6 

S6 

S6 

S7 

S7 

S7 

S7 

S8 

S7 

S7 

S7 

S7 

S8 

S8 

S8 

S8 

S8 

S30 

S8 

S8 

S8 

S9 

S22 

S12 

S29 

S22 

S9 

S9 

S29 

S29 

S10 

S10 

S10 

S10 

S10 

S10 

S10 

S10 

S10 

Sll 

Sll 

S34 

Sll 

S15 

Sll 

S32 

S22 

Sll 

S12 

S12 

S12 

S12 

S12 

S12 

S12 

S40 

S12 

S13 

S40 

S13 

S13 

S13 

S40 

S13 

S25 

S13 

S14 

S14 

S14 

S14 

S14 

S28 

S20 

S14 

S14 

S15 

S15 

S15 

S15 

S15 

S29 

SI 

S40 

S15 

S16 

SI 

S28 

S28 

SI 

S16 

S16 

S20 

S16 

S17 

S17 

S36 

S17 

S17 

S17 

S17 

S17 

S17 

S18 

S18 

S18 

S18 

S18 

S18 

S18 

S18 

S18 
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S19 

S19 

S19 

S19 

S25 

S19 

S6 

S37 

S19 

S20 

S22 

S20 

S20 

S22 

S40 

S20 

S20 

S20 

S21 

S21 

S24 

S21 

S6 

S21 

S21 

S21 

S21 

S22 

S22 

SI 

S22 

S22 

S22 

S22 

S22 

S22 

S23 

S23 

S23 

S23 

S23 

S23 

S23 

S22 

S23 

S24 

S24 

S24 

S24 

S31 

S3 

S24 

S21 

S24 

S25 

S25 

S39 

S25 

S25 

S7 

S38 

S25 

S25 

S26 

S26 

S26 

S26 

S26 

S4 

S26 

S6 

S26 

S27 

S27 

S27 

S27 

S14 

S27 

S27 

S27 

S27 

S28 

S28 

S28 

S18 

S28 

S28 

S26 

S28 

S28 

S29 

S40 

S29 

S40 

S29 

S30 

S29 

S18 

S29 

S30 

S30 

S30 

S30 

S23 

S30 

S30 

S30 

S30 

S31 

S31 

S31 

S31 

S31 

S30 

S31 

S31 

S31 

S32 

S32 

S32 

S32 

S32 

S21 

S16 

S32 

S32 

S33 

S33 

S31 

S33 

S30 

S33 

S35 

S25 

S33 

S34 

SI 

S34 

SI 

S40 

S40 

S34 

S40 

S40 

S35 

S35 

S30 

S35 

S35 

S3 

S35 

S35 

S35 

S36 

S36 

S36 

S36 

S12 

S36 

S36 

S36 

S36 

S37 

S37 

S37 

S37 

S37 

S32 

S37 

S27 

S37 

S38 

S38 

S38 

S38 

S38 

S38 

S38 

S6 

S38 

S39 

S39 

S39 

S39 

S39 

S33 

S21 

S2 

S39 

S40 

S40 

S40 

S40 

S25 

S21 

S40 

S40 

S40 


TABLE II 

Outputs of different face recognition algorithms for test face 2 


Face 2 

Original 

AT&T 

Database 

PCA 

Algorithm 

Eye pair 
Feature 

Face 

Feature 

Left Eye 
Feature 

Mouth 

Feature 

Nose 

Feature 

Right Eye 
Feature 

Proposed 

Algorithm 

SI 

S40 

S40 

S40 

S7 

S36 

S15 

S16 

S40 

S2 

S2 

S2 

S2 

S7 

S2 

S2 

S2 

S2 

S3 

S3 

S10 

S3 

S40 

S3 

S3 

S10 

S3 

S4 

S4 

S4 

S4 

S8 

S4 

S4 

S31 

S4 

S5 

S5 

S5 

S5 

S29 

S5 

S5 

S5 

S5 

S6 

S6 

S37 

S6 

S13 

S6 

S6 

S23 

S6 

S7 

S7 

S7 

S7 

S7 

S7 

S7 

S7 

S7 

S8 

S8 

S16 

S39 

S8 

S8 

S8 

S16 

S8 

S9 

S9 

S9 

S9 

S35 

S9 

S9 

S9 

S9 

S10 

S40 

S24 

S10 

S10 

S10 

S38 

S15 

S10 

Sll 

Sll 

Sll 

Sll 

S28 

Sll 

Sll 

S3 

Sll 

S12 

S12 

S12 

S12 

S12 

S21 

S12 

S40 

S12 

S13 

S13 

S13 

S13 

S13 

S13 

S13 

S6 

S13 


S14 

S30 

S14 

S14 

S14 

S28 

S21 

S14 

S14 

S15 

S40 

S15 

S15 

S40 

S15 

S40 

S15 

S15 

S16 

S16 

S16 

S16 

S24 

S14 

S24 

S16 

S16 

S17 

S21 

S17 

S21 

S21 

S19 

S17 

S17 

S17 

S18 

S18 

S18 

S18 

S18 

S18 

S21 

S18 

S18 

S19 

S16 

S19 

S28 

S28 

S19 

S28 

S28 

S28 

S20 

S20 

S20 

S20 

S20 

S40 

S4 

S20 

S20 

S21 

S21 

S12 

S21 

S31 

S30 

S21 

S16 

S21 

S22 

S22 

SI 

S22 

S22 

S22 

S22 

S22 

S22 

S23 

S23 

S29 

S23 

S40 

S30 

S4 

S23 

S23 

S24 

S24 

S24 

S24 

S24 

S24 

S24 

S24 

S24 

S25 

S25 

S26 

S25 

S25 

S25 

S25 

S25 

S25 

S26 

S26 

S30 

S26 

S26 

S33 

S5 

S12 

S26 

S27 

S27 

S27 

S27 

S27 

S27 

SI 

S37 

Sll 

S28 

S28 

S28 

S28 

S28 

S16 

S28 

S27 

SIS 

S29 

S40 

S39 

S40 

S6 

S29 

S29 

S27 

S29 

S30 

S30 

S12 

S30 

S12 

S30 

S5 

S30 

S30 

S31 

S31 

S38 

S31 

S38 

S30 

S13 

S37 

S38 

S32 

S32 

S32 

S32 

S32 

S16 

S16 

S32 

S32 

S33 

S33 

S33 

S33 

S15 

S33 

S33 

S33 

S33 

S34 

S34 

S34 

S34 

S34 

S34 

S34 

S34 

S34 

S35 

S35 

S35 

S35 

S24 

S5 

S40 

S14 

S35 

S36 

S40 

S14 

S36 

S7 

S36 

S40 

S6 

S36 

S37 

S22 

S37 

S37 

S22 

S37 

S21 

S14 

S37 

S38 

S38 

S38 

S38 

S40 

S38 

S38 

S38 

S38 

S39 

S39 

S39 

S39 

S39 

S39 

S39 

S38 

S39 

S40 

S40 

S13 

S40 

S40 

S8 

S40 

S8 

S40 


The proposed recognition engine is also verified on 
different seven data sets available. The improvement in face 
recognition rate for all these data sets is listed in table. 

The performance metrics for different algorithms shown 
below are with ORL database. All the performance curves 
show that irrespective of the algorithm, by changing the way 
of igniting the recognition engine, the performance of the 
algorithm is optimized. The proposed algorithm performed 
well and is shown in all the performance curves in blue lines. 

Performance curves of PCA Algorithm 
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(a) 


Principle Compoenent Analysis ROC Curs es 



(b) 


Fig. 28. (a) CMC and (b) ROC Curves of PC A Algorithm for Face, Mouth, 
Eye pair, right eye, left eye, nose and head data sets. The proposed algorithm 
is shown in blue. 


Performance curves of LDA Algorithm 


Linear Discriminant Analysis CIVIC Curves 



Rank 

Linear Discriminant Analysis ROC Curses 



(a) 


(b) 

Fig. 29. (a) CMC and (b) ROC Curves of LDA Algorithm for Face, Mouth, 
Eye pair, right eye, left eye, nose and head data sets. The proposed algorithm 
is shown in blue. 


VI. Conclusions 

In this paper, we have formulated a face partitioned 
algorithm and the decision making algorithm. A new and 
powerful way of igniting the recognition engine is introduced. 
This technique is verified on 8 different datasets. This 
partition based ignition outperforms other face recognition 
algorithms. Here, the face is divided into seven different face 
parts. Some of the face parts are redundant like right eye, left 
eye and eye pair for example. This redundancy is purposefully 
included in order to face the challenges like pose and 
illumination variations. Instead of dividing the faces into face 
parts, the Eigen faces can be divided without any redundancy 
compromising the pose and illumination changes. Therefore, 
one of our future works will be developing more efficient way 
of igniting the recognition engine by dividing the eigen faces 
in to either 4 or 9 or 16 parts by dividing the face image in to 


2X2 or 3X3 or 4X4 matrices. 
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TABLE III 

Different Datasets and their total number of images and persons 


Data base 

Total number 
of persons 

Pose, 

Illumination 
and facial 
expression 
variations 

Total 
number 
of face 
images 

Color FERET 
[7] - [8] 

1199 

individuals 

365 

duplicates 

9 

14126 

Yale Database 
[9] 

15 

11 

165 

Yale Face 
Database ‘B’ 
[10] 

10 

64 illumination 
9 poses 

5760 

BioID [12] — 
[13] 

23 

60-70 

1521 

Georgia Tech 
[14] 

50 

15 

750 

FEI [15] 

2000 

14 

17000 

Labeled faces 
in the wild 
[16] -[17] 

5749 

1-20 

13233 


TABLE IV 

Different Datasets and their total number of images and persons 
used on PC A Algorithm 


Database 

Total number of people considered 

Total number of faces per person 

Faces considered for testing 

Faces considered for training 

Face recognition 
rate (in %) 

PCA Algorithm 

Proposed Algorithm 

Color FERET 

40 

9 

8 

1 

61.0 

70.26 

Yale Database 

15 

11 

9 

2 

88.26 

91.02 

Yale Face 
Database ‘B’ 

10 

10 

8 

2 

80.01 

85.56 

BioID 

20 

20 

16 

4 

66.36 

69.58 

Georgia Tech 

50 

15 

13 

2 

81.63 

91.02 
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FEI 

50 

14 

12 

2 

77.89 

80.19 

Labeled faces 
in the wild 

40 

10 

8 

2 

61.0 

68.5 


TABLE V 

Different Datasets and their total number of images and persons 
used in LD A Algorithm 


Database 

Total number of people considered 

Total number of faces per person 

Faces considered for testing 

Faces considered for training 

Face recognition 
rate (in %) 

LDA Algorithm 

Proposed Algorithm 

Color FERET 

40 

9 

8 

1 

63.11 

68.19 

Yale Database 

15 

11 

9 

2 

85.69 

86.59 

Yale Face 
Database ‘B’ 

10 

10 

8 

2 

81.16 

83.69 

BioID 

20 

20 

16 

4 

69.68 

75.16 

Georgia Tech 

50 

15 

13 

2 

80.05 

89.16 

FEI 

50 

14 

12 

2 

81.15 

84.13 

Labeled faces 
in the wild 

40 

10 

8 

2 

59.16 

64.59 


In case of testing images taken are more than one, then the 
face recognition rate is calculated by taking the average of the 
face recognition rates of all the testing images. 
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Abstract — Software as a service (SaaS) is a Cloud Computing 
service model that exploits economies of scale for SaaS service 
providers by offering a single configurable software and 
computing environment for multiple tenants. This contemporary 
multi-tenant service requires a multi-tenant database that 
accommodates data for multiple tenants using a single database 
schema. In general, traditional Relational Database Management 
Systems (RDBMS) do not support multi-tenancy and require 
schema extensions to provide multi-tenant capabilities. This paper 
proposes a multi-tenant database schema called Elastic Extension 
Tables (EET), which is highly flexible in enabling the creation of 
database schemas for multiple tenants by extending a preexisting 
business domain database, or by creating tenant business domain 
database from the scratch at runtime. The empirical results 
presented in this paper indicate that the EET schema has potential 
to be used for implementing multi-tenant databases for multi- 
tenant SaaS applications. 

Index Terms — Cloud Computing, Software as a Service, Multi- 
tenancy, Elastic Extension Tables, Multi- tenant Database. 


I. Introduction 

C LOUD Computing has recently emerged as a new 

computing paradigm that transforms the IT industry, 
making the computing software and hardware more appealing 
to use as a service over the internet [17], [26]. This new 

computing paradigm has been gaining popularity for two 
reasons. First, the internet has become affordable and its speed 
has significantly increased [29]. Second, rapid growth in 
computer usage, in areas such as businesses, governments, 
health services, education, social media networks, mobile 

applications, and other computational aspects [17]. This 
increase in internet speed and the computer usage resulted in 
the need to maximize the use of computational resources and to 
minimize the cost. Cloud Computing offers a solution to this 


need by moving applications and their data from desktop and 
portable Personal Computers into large data centers [16]. Cloud 
Computing is rapidly evolving, with the prospects that it will be 
one day the fifth used utility after water, electricity, gasoline, 
and telephone [5], [15], [19]. Cloud Computing includes a 
number of service delivery models such as Infrastructure as a 
Service (IaaS), Platform as a Service (PaaS), and Software as a 
Service (SaaS) [16], [18], [24], [27]. Multi-tenancy is a 
fundamental characteristic of Cloud Computing services that 
allows SaaS vendors to run a single application that support 
multiple tenants using the same software and hardware 
infrastructure [13], [25], [28]. It is a common practice in SaaS 
applications to use a multi-tenant database architecture with a 
single database schema shared among all tenants [4], [21]. 
Cloud database service providers regard such a database as an 
effective resource sharing storage as it reduces the costs by co- 
locating multiple tenants’ databases into a single database 
schema. It also reduces the total cost of ownership of the 
service. Such data architecture consists of two types of data: 
shared data and tenant’s private isolated data. Combining these 
two types of data provides tenants with a complete view of data 
that fits their business requirements [7], [9]. 

Most modern Relational Database Management Systems 
(RDBMS) have been designed to manage data for a single 
tenant. However, single-tenant databases do not support the 
unique requirements of individual tenants and this can lead to 
incorrect assumptions and query plans [1], [21]. Various multi- 
tenant database schema techniques have been studied and 
implemented to overcome this challenge, including Private 
Tables, Extension Tables, Universal Table, Pivot Tables, 
Chunk Table, Chunk Folding, and XML Table [2], [8], [12], 
[14], [22], [21]. These multi-tenant schema techniques are 
based on traditional RDBMS [4], [7]. However, these multi- 
tenant schema techniques suffer from various limitations that 
still need to be addressed [5], [11], [21], [23], and overcoming 
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these limitations in the context of SaaS applications has 
received a lot of attention, both from academic and industry- 
based researchers. 

In this paper, we propose a novel multi-tenant database 
schema called Elastic Extension Tables (EET) that consists of 
Common Tenant Tables (CTT), Extension Tables (ET), and 
Virtual Extension Tables (VET). This multi -tenant schema 
enables tenants to build their own virtual database schema by 
creating the required number of tables and columns, creating 
virtual database relationships, and assigning suitable data types 
and constraints for table columns during multi -tenant 
application run-time execution. It also gives tenants the 
opportunity to address their individual business requirements 
by choosing from three database models: (1) Multi-tenant 
Relational Database, (2) Integrated Multi-tenant Relational 
Database with Virtual Relational Database, and (3) Virtual 
Relational Database. In addition, it allows tenants to store 
different data types, including structured, semi-structured, and 
unstructured data. In this paper, several experiments are 
performed to evaluate the feasibility and effectiveness of EET 
multi-tenant database schema by comparing it with Universal 
Table Schema Mapping (UTSM) [2], which is commercially 
used by Salesforce. Significant performance improvements 
obtained using EET when compared to UTSM, makes the EET 
schema a good candidate for implementing multi -tenant 
databases and multi-tenant applications. 

The rest of the paper is organized as follows: section 2 
discusses the related work of multi-tenant database schema 
designs. Section 3 proposes the Elastic Extension Tables multi- 
tenant database schema. Section 4 proposes three Elastic 
Extension Tables database models. Section 5 presents an 
example to compare other multi-tenant database schema 
designs with the Elastic Extension Table design. Section 6 
presents a set of experiments that compare the performance of 
Elastic Extension Tables with Universal Table Schema 
Mapping. Section 7 concludes this paper and discusses future 
work. 

II. Related Works 

A number of multi-tenant database schema designs and 
techniques have studied and implemented to address multi- 
tenant database challenges. This section presents seven multi- 
tenant database schema techniques, including Private Tables, 
Extension Tables, Universal Table, Pivot Tables, Chunk Table, 
Chunk Folding, and XML Table [2], [8], [12], [14], [22], [23]. 
All of these multi-tenant database schema techniques are based 
on traditional RDBMS [4], [7]. 

A. Private Tables 

The Private Tables technique allows each tenant to have his 
own private tables, which can be extended and changed [22], 
[23]. Using this multi-tenant query technique can be 
transformed from one tenant to another by renaming tables, and 
metadata without using extra columns like ‘tenant_id’ to 
distinguish and isolate the tenants’ data. In contrast, many 
tables are required to satisfy each tenant needs. Therefore, this 
technique is suitable only for a small number of tenants to 


ensure sufficient database load and good performance [23]. 

B. Extension Tables 

The Extension Tables are separated tables joined with the 
base tables by adding tenants’ columns to construct logical 
source tables [22], [23]. This technique adapted from the 
Decomposed Storage Model that splitting up n-columns table 
into n 2-column tables joined using surrogate values [22]. 
Multiple tenants can use the base tables and the extension tables 
[7]. It is regarded as a better design when compared to Private 
Tables described above. Using this design, the number of tables 
grows with the number of tenants, and variety of their different 
business requirements [22] . 

C. Universal Table 

A Universal Table contains a large number of columns that 
enable tenants to store their required columns. It is structured 
with two main columns 'tenant_id' and 'table_id', and other 
generic data columns, which have a flexible VARCHAR data 
type in which different data types with different data values can 
be stored in these columns [2], [22]. A flexible technique that 
enables tenants to extend their tables in different ways 
according to their business needs. However, the rows of the 
universal table can be too wide with an overhead in the number 
of NULL values, which the database has to handle [22]. 

D. Pivot Tables 

In using the Pivot Tables technique, the application maps the 
schema into generic structure in the database, in which each 
column of each row in a logical source table is given its own 
row in the Pivot Table. The rows in the Pivot Table comprise of 
four columns, including tenant, table, column, and row that 
specifies which row in the logical source table they represent. 
It also includes a single data type column that stores the values 
of the logical source table rows according to their data types in 
the designated pivot Table [8], [21]. For example, the Pivot 
Tables can include two pivot tables, the first table 'pivot_inf to 
store INTEGER values, and the second table 'pivot_str' to store 
STRING values. The performance benefits are achieved using 
this technique by avoiding NULL values and by selectively 
reading from smaller numbers of columns. Pivot Tables 
technique, which partitions data vertically performs better when 
it allows selectively read in columns to improve the 
performance, when it compared with others multi-tenant 
database schema techniques that partition data horizontally (e.g. 
Universal Table) [22]. 

E. Chunk Table 

The Chunk Table is another generic structure technique that 
is similar to Pivot Table, except it has a set of data columns with 
a mixture of data types that replace the column 4 col’ in the Pivot 
Table with ‘chunk’ column in the Chunk Table [22]. This 
technique partitions the logical source table into groups of 
columns. Each group is assigned a chunk ID and is mapped into 
an appropriate Chunk Table. This technique has four 
advantages over Pivot Table, including (1) Reducing metadata 
storage ratio, (2) reducing the overhead of reconstructing the 
logical source tables, (3) reducing the number of columns, and 
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(4) providing indexes. This technique is flexible, but it adds 
complexity to database queries [22] . 

F. Chunk Folding 

Chunk Folding is a schema mapping technique that partitions 
logical source tables into chunks vertically [8], [22]. These 
chunks are folded in different physical tables and joined 
together, where a chunk of columns is partitioned into a group 
of columns and each group has a chunk id [8]. Aulbach et al. 

[22] performed experiments to measure the efficiency of Chunk 
Table and Chunk Folding techniques, and they found that 
Chunk Folding technique outperform the Chunk Table 
technique. In addition, they state that the performance of this 
technique is enhanced by mapping the most used tenants’ 
columns of the logical schema into conventional tables, and the 
majority of tenants does not use the remaining columns in the 
Chunk Tables. However, the main limitation and weakness of 
the Chunk Folding technique is that the common schema that is 
used by multiple tenants must be known in advance, which is 
not a practical solution for multi-tenant databases. This issue is 
also present in Extension Tables, Pivot Tables, and Chunk 
Table multi-tenant schema techniques. 

G. XML Table 

The XML Table database extension technique is a 
combination of relational database and Extensible Markup 
Language (XML) [8], [12], [23]. The tenants’ extension 
columns can be provided as native XML data type, or storing 
the XML document in the database as a Character Large Object 
(CLOB) or Binary Large Object (BLOB) [23]. XML data type 
facilitating the creation of database tables, columns, views, 
variables and parameters, and isolating the application from the 
relational data model [12]. This technique satisfies tenants’ 
needs because their data can be handled without changing 
original database relational schema, and XML data type can be 
supported by several relational database products [8], [12]. 
However, this technique reduces the data access performance 

[23] , and Heng et al. [14] state that this technique has the 
poorest performance (e.g. highest response time), when 
compared to Private Tables, Universal Table, Pivot Tables, 
Chunk Table and Chunk Folding techniques. 

Heng et al. [14] conducted a number of experiments to 
evaluate retrieving data from five different multi -tenant 
schemas used in multi-tenant SaaS applications, including 
Private Tables, Universal Table, Pivot Tables, Chunk Table, 
Chunk Folding, and XML Table. The results of these 
experiments show that retrieving data from Universal Table is 
faster than the other schema techniques, except the Private 
Tables schema. Aulbach et al. [23] conducted experiments to 
compare Private Table schema and the Universal Table (Spare 
Columns) schema. The results of these experiments show that 
the Universal Table schema has the same or better performance 
than the Private Tables schema when retrieving or inserting 
data, except when inserting a large amount of data, the 
Universal Table schema is slower than the Private Tables 
schema. Such experimental results lead to conclusion that the 
query performance of Universal Table schema is the best 
performance out of the five multi-tenant schema techniques, as 
the Private Tables schema is only suitable for a small number 
of tenants. Overall, the experimental results make the Universal 


Table schema the optimal schema to use for a multi-tenant 
database when it is compared to Pivot Tables, Chunk Table, 
Chunk Folding, and XML Table. Nevertheless, the Universal 
Table can be too large introducing overhead with the number of 
NULL values, which the database has to handle. This suggests 
that the currently available multi-tenant database schemas still 
have remaining challenges, and represent suboptimal designs. 
Section 5 presents an example that clarifies how the data is 
populated in the seven multi-tenant database schema designs 
that are discussed in this section. 

III. Elastic Extension Tables 

The EET multi-tenant database schema proposes a novel way 
of designing and creating an elastic database that consists of 
three table types, the first type is CTT, the second type is ET, 
and the third type is VET. Fig. 1 shows the details of EET multi- 
tenant schema. The design of this schema enables tenants to 
build their own virtual database schema by creating the required 
number of tables and columns, rows, creating virtual database 
relationships, and assigning suitable data types and constraints 
for table columns during the runtime execution of a multi-tenant 
application. 

A. Common Tenant Tables 

The Common Tenant Tables are the tables that can be shared 
between tenants who are using a multi-tenant single database 
schema. These tables are RDBMS, and are used as a business 
domain database schema that is shared between multiple 
tenants. For example, a multi-tenant application of a sales 
business domain may have a database schema with sales tables, 
such as salesperson, customer, product, sales -fact, and any 
other sales tables. These tables have columns that are used by 
most of the tenants, and the column tenant ID is used to 
differentiate between the tenants’ rows. For example, the 
‘salesperson’ CTT in Fig. 11 shows some common columns, 
such as ‘first name’, and Tast_name’, while the ‘tenant_id’ 
column is used to differentiate between the tenants’ rows. 

B. Extension Tables 

The Extension Tables are metadata tables that are used to 
create virtual tables for multiple tenants who are using a single 
multi-tenant database schema during the application’s runtime 
execution. They consists of the following eight physical tables: 
1) Dbjtable Extension Table 

The ‘db_table’ ET allows tenants to create virtual (logical) 
tables and give them unique names. The structure of this 
table has a composite primary key that consists of 
‘db_table_id’ and ‘tenant_id’ columns. The ‘db_table_id’ 
column is a unique primary key of the table, while the 
‘tenantid’ column is a foreign key refers to the ‘tenant’ 
CTT and at the same time is a combined primary key with 
‘db_table_id’ for this table. In addition, this table has the 
‘db_table_name’ column that stores the virtual tables’ 
names. In using this table, each tenant can have unique table 
names. For example, tenant- A can create a VET name 
‘salesperson’, but cannot create the same VET name again 
for his VETs. However, tenant-B can create the 
‘salesperson’ name even if tenant- A already created this 
VET’s name. 
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2 ) Table _column Extension Table 

The ‘table_column’ ET allows tenants to create virtual 
columns for a VET that created in the ‘db_table’ ET. The 
structure of this table has a composite primary key consists 
of ‘table_column_id’, ‘tenant_id’, and ‘dbtableid’. The 
‘tablecolumnid’ is a unique primary key for this ET, 
while the other two columns ‘tenant_id’ and ‘db_table_id’ 
are primary keys in this table, and foreign keys that refer to 
primary key columns of the ‘tenant’ CTT, and the 
‘dbtable’ ET. Moreover, this table has other columns, 
including ‘table_cohimn_name’, ‘default_value’, 
‘data_type’, ‘is_indexed’, ‘is_null’, ‘is_relationship’, 
4 is_primary_key_column’, and 4 is_unique_column’. The 
‘table_column_name’ column has UNIQUE constraint, and 
VARCHAR data type. The ‘default_value’ column stores 
already defined value to be used once the database saves a 
table row, when there is no value specified to be stored in 
this column. The ‘data type’ column specifies the data type 
of a virtual column that is stored into any of the three row 
ETs, which are presented in the following point. The 
‘is_indexed’ column specifies whether a column has an 
index or not. The ‘is_null’ column specifies whether a 
column accepts to store NULL values or not, and if it does 
not, then this column is considered a mandatory column that 
must have a value. The ‘is_relationship’ column specifies 
whether a column has at least one relationship with any of 
the CTTs or the VETs. The ‘is_primary_key_column’ 
column specifies whether the column is a primary key. The 
‘is unique column’ column specifies whether a column has 
a UNIQUE constraint. 

3) The Row Extension Tables 

The row ETs store virtual table rows for virtual extension 
columns in three separate ETs. Such ETs are separated in 
three tables in order to store small data values in the 
‘table_row’ ET, which stores values such as NUMBER, 
DATE-and-TIME, BOOLEAN, VARCHAR and other data 
types. While large data values are stored in other two ETs, 
the first ET is the ‘table_row_blob’ that stores BLOB values 
of virtual columns that stores BLOB data type (e.g. Images, 
Audio, Video), and the second ET is the ‘table_row_clob’ 
that stores CLOB values for virtual columns that store 
TEXT data type (e.g. E-mails, web pages). The EET design 
separates these three ETs to reduce the impact of BLOB and 
CLOB values from slowing down virtual schema queries. 
These three tables have the same columns, except the table 
row ID column, which is called differently in the three 
tables. In the ‘table_row’ ET called ‘table_row_id’, in the 
‘table_row_blob’ ET called ‘table_row_blob_id’, and in the 
‘table_row_clob’ called ‘table_row_clob_id’. A table row 
ID can be given for several columns that map to one row in 
a VET. Fig. 14 shows an example of this mapping. The 
corresponding columns in these three tables include, first, 
the ‘serial_id’ column which is a composite primary key in 
these tables. This column stores a serial number of a virtual 
column that maps to a row in the virtual table. Second, the 
foreign key columns, including ‘tenant_id’, ‘db_table_id’, 
and ‘table_column_id’ which at the same time are 
composite primary keys with the Table Row ID column and 
the ‘serial_id’ column. Third, the ‘value’ column that stores 
the virtual column values, however, the data types of these 


columns vary in each of the three row tables according to 
the data types that supposed to be stored in each table. These 
three row ETs are capable to store data types, including 
traditional relational data, texts, audios, images, videos, and 
XML in structured, unstructured, and semi -structured 
format. The structured data, such as traditional relational 
data can be stored in CTTs and VETs as it is presented in 
the EET design in Section 5. The un-structured data files 
such as images, audios, videos can be stored in EET, by 
storing the Uniform Resource Identifier (URI) of a file in 
the ‘table_row_blob’ ET. Then the actual physical file can 
be stored in a folder of a file system, and then this file can 
be accessed using the URI that stored in the 
‘table_row_blob’ ET and mapped to the physical file that 
stored in a folder. The semi -structured data such as XML 
files can be used in two ways. Firstly, using the same 
method as used for storing unstructured data, then accessing 
the XML file using the URI that stored in the 
‘tablerowblob’ ET and mapped to the physical XML file 
that stored in a folder. Secondly, an XML file can be stored 
as text in the ‘table row clob’ ET as a CLOB file, and then 
accessed from the ‘table_row_clob’ ET. It is being argued 
that RDBMSs are not scalable, because they are limited in 
offering good performance and scalability properties. 
Nevertheless, this issue can be resolved by using any of the 
available distributed software products in the market that 
scale and optimize RDBMSs on the cloud, such as MySQL 
Cluster, VoltDB, Clustrix, ScaleDB, NuoDB, ScaleBase 
[20], and many others. 

4 ) Primary Key Extension Table 

The ‘table_primary_key_column’ ET allows tenants to create 
virtual primary keys for the virtual extension columns which 
are stored in the ‘table_column’ ET. The structure of this table 
has a composite primary key consists of 
‘table_primary_key_column_id’, ‘tenant_id’, ‘db_table_id’, 
and ‘table_column_id’. The ‘table_primary_key_column_id’ 
column is a unique primary key of the table, while the other 
three columns ‘tenant_id’, db_table_id’, and ‘table_column_id’ 
are primary keys and foreign keys. The ‘is auto increment’ 
column specifies whether a primary key can be auto- 
incremented or not. The ‘is_composite_key’ column is used to 
specify whether a virtual primary key that is stored in a table is 
a single primary key or a composite primary key. 

5 ) Relationship Extension Table 

The ‘table_relationship’ ET allows tenants to create virtual 
relationships between their VETs and CTTs. The table structure 
has a composite primary key consists of ‘table_relationship_id’, 
‘tenant_id’, ‘db_table_id’, and ‘table_column_id’. The 
‘table_relationship_id’ column is a unique primary key of the 
table, while the other three columns ‘tenant_id’, ‘db_table_id’, 
and ‘tablecolumnid’ are primary keys and foreign keys. The 
‘table_type’ column specifies whether the relationship is with a 
CTT or a VET. The ‘target table id’ column is used to create a 
master-detail relationship between two VETs, by storing into it 
the table ID of the master VET that is stored in the ‘dbtable’ 
ET, while the ‘targeted_column_id’ column is used to store into 
it the primary key ID of the master VET for the same 
relationship. The ‘shared_table_name’ column is used to create 
a master-detail relationship between a CTT and a VET, by 
storing into it the name of the master CTT while the name of 
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the 4 shared_cohimn_name’ column is used to store the primary 
key column name of the CTT for the same relationship. 
Furthermore, this ET can create a master-detail relationship 
between two VETs, or a CTT and a VET, even if the master 
table has composite primary keys. Such a relationship can be 
achieved by storing multiple table rows into the 
‘tablerelationship’ for the relationship that is between the 
master table that has a composite primary key, and the details 
VET. Each of these table rows denotes one of the primary key 
columns of the composite primary key that relates to the master 
table. The following are the database relationships that can be 
created using the 4 table_relationship’ ET between two VETs, 
two CTTs, or one VET and one CTT, including One-to-One, 
One-to-Many, Many-to-One, Many-to-Many, and Self- 
referencing. 

6) Index Extension Table 

The 4 table_index’ ET is used to add indexes for virtual columns 
of a VET to improve and speed up the query execution time 
when retrieve data from this VET. The structure of this table 
has a composite primary key consists of 4 table_row_id’, 
4 serial_id’, ‘tenantid’, 4 db_table_id’, and ‘tablecolumnid’. 
The column 4 table_row_id’ and 4 serial_id’ are unique primary 
keys that are referred to values stored into 4 table_row_id’ and 
4 serial_id’ columns in the 4 table_row’ ET. While the other three 
columns 4 tenant_id’, 4 db_table_id’ and 4 table_column_id’ are 
primary keys and foreign keys for this table. The ‘value’ 
column stores a value that is stored in the ‘tablerow’ ET and 
this value relates to an indexed virtual column, which is 
specified as an index in the ‘tablecolumn’ ET by storing the 
necessary value in the 4 is_indexed’ column. 


Extension Tables Common Tables 



Fig. 1 Elastic Extension Tables. 


C. Virtual Extension Tables 

Virtual Extension Tables are the tables that tenants can create 
during the application’s runtime execution to extend an existing 
business domain database schema, or they can create their own 
virtual database schema from the scratch to fulfil their business 
needs. In Section 5, a detailed example is presented to explain 
how the tenants can create their VETs. In EET, VETs are 
created as a metadata into the eight ETs. In using this approach, 
the service provider who is offering a business domain 
database, can accommodate a large number of virtual tables by 
allowing tenants to populate these eight ETs with their data. 
This approach allows multi-tenant database service providers to 
manage their services in an efficient and cost-effective manner, 
and at the same time, it allows each tenant to configure its 
database schema according to its requirements. 

IV. Elastic Extension Tables Database Models 

The EET multi-tenant database schema allows the service 
provider to offer his tenants with the choice of using any of 
the following three database models (Fig. 2): 



A. Multi-tenant relational database 

This database model allows tenants to use a standard 
relational database schema for a particular business domain 
database without the need to extend the existing database 
structures. This business domain database, can be shared 
between multiple tenants and differentiate between them by 
using a Tenant ID column in the CTTs (physical tables). This 
model can be applied to any business domain database such as 
Customer Relationship Management (CRM), Accounting, 
Human Resources (HR), or other business domains. 

B. Integrated multi-tenant relational database with virtual 
relational database 

This database model allows tenants to use a standard 
relational database schema for a particular business domain, 
extend it by adding additional virtual database tables, and 
combine these tables with the existing database structure by 
creating virtual relationships between them. 

C. Multi-tenant virtual relational database 

This database model allows tenants to create their virtual 
database schema from the scratch, by creating VETs, virtual 
database relationships between the VETs, and other database 
constraints to satisfy the tenants’ special business requirements 
of the tenants’ business domain applications. 
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For example, if a service provider offers a sales database 
schema to be used by multiple tenants, and with this database 
schema the service provider uses the EET, then this service 
provider can offer the three database models listed above that 
fulfil various business requirements. This example assumes that 
the service provider has three tenants. 


(a) 


Multi-tenant Relational Database Model 


Sales Database Schema 





Tenant - 1 

ilfr 




Fig. 3 The EET Three Database Models Example. 


three database models allow tenants to design their databases 
and automatically configure their behaviors during their 
application’s runtime execution. 

V. An Example to Compare Multi-tenant Database 
Schema Designs With Elastic Extension Tables 

This section presents an example that clarifies the seven 
multi-tenant database schema designs that presented in the 
related work section, and clarifies the differences between these 
designs and the EET multi-tenant schema design. This example 
shows three different tenants, including Tenant- A, Tenant-B, 
and Tenant-C. Each of these tenants uses a multi-tenant 
database, and in this database, they configure their sales 
database structure according to their different business needs. 
For simplicity, this example illustrates only one sales table that 
stores a sales person’s information by using different multi- 
tenant database schema designs. Moreover, this example 
presents how the EET enables tenants to create their own 
database schema by extending an existing RDBMS database 
schema, including the required number of tables and columns, 
rows, virtual database relationships with any of the CTTs or 
VETs, primary keys for the columns, indexes for the columns, 
and assigning suitable data types for columns during multi- 
tenant application runtime execution. In order to show the 
difference between the table structures and how database is 
populated we use the same data across all the designs in this 
example. 

The Private Tables in Fig. 4 show three tenants each of them 
with different Sales Person table that fulfil their business 
requirements. Tenant- A has the ‘sales_person_tenant_a’ table, 
which consists of six columns, including ‘sales_person_id’, 
‘first_name’, Tast_name’, ‘phone’, ‘age’, and ‘gender’. 
Tenant-B has the ‘sales_person_tenant_b’ table, which consists 
of four columns, including ‘sales_person_id’, ‘first_name’, 
Tast_name’, and ‘business_id’. Tenant-C has the 
‘sales_person_tenant_c’ table; the columns in this table are the 
same as ‘sales_person_tenant_a’ table. The same data that was 
used to populate the private table was used to populate the rest 
of the multi-tenant database schema designs and EET schema, 
which are presented in the example of this section. 


The first user evaluated the Sales database, and found that 
this database suits his business requirements without any 
modifications. Therefore, this user will use the Sales database 
schema as originally provided by the service provider as 
illustrated in Fig. 3 (a). The second user has evaluated the Sales 
database schema and found that he needs to add extra tables to 
fulfil his business needs. Thus, this user created VET 1, VET 2, 
and VET 3, and then, created virtual database relationships 
between these VETs and the already existing physical tables 
(CTTs) in the sales database schema. The database model for 
this user is shown in Fig. 3 (b). The third user evaluated the 
same database schema and found that it did not suit his business 
requirements. Therefore, he decided not to use the Sales 
database schema at all, and instead created virtual relational 
tables from scratch and established database relationships 
between them as shown in Fig. 3 (c). This example illustrates 
the three database models of EET multi -tenant schema. These 
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150 

Sam 

Zen 

07123456739 

23 
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Fig. 4 Private Tables. 

The Extension Tables in Fig. 5 show how the columns of the 
Sales Person tables for the three tenants split-up between the 
base table ‘salesperson’ and two extension tables 
‘sales_person_tenant_a_&_c’ and ‘sales_person_tenant_b’. All 
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of these three tables have two fixed common columns, 
including ‘tenant_id’ and ‘row’. The ‘tenant_id’ column is used 
to map data rows in the base table and the extension tables with 
the tenant who owns these rows. The ‘row’ column is used to 
give each row in the base table a row number and map it with 
other rows in the extension tables. The ‘salesperson’ base table 
has five columns, including ‘tenant_id’, ‘row’, 
‘sales_person_id’, ‘first_name’, and Tast_name\ All the 
tenants share the last three columns. The extension table 
‘sales_person_tenant_a_&_c’ has five columns, including 
‘tenant_id’, ‘row’, ‘phone’, ‘age’, and ‘gender’. This table is 
shared by two tenants Tenant- A and Tenant-C, due to the 
similarity in the extension columns that both tenants need. The 
‘sales_person_tenant_b’ is used by Tenant-B, which has three 
columns ‘tenant_id’, ‘row’, and ‘business_id’. 


sales _person 


tenant ill 

row 

sales person ill 

Ill’s t name 

last name 

1 

0 

100 

Joseph 

Richard 

1 

1 

101 

Sarah 

Smith 

2 ~1 

0 

200 

David 

John 

3 

0 

150 

Sam 

Zen 


Base table 


sales person tenant a 

* c 1 



tenant ill 

row 

plume 

age 

gemler 

1 

0 

02123456789 

25 

male 

1 

1 

02123456788 

34 

female 

3 

0 

07123456789 

28 

male 


Teiiant-a t£ c 


sales person tenant b 


tenant ill 

row 

business ill 

2 

0 

123456 


Tenantb 

Fig. 5 Extension Tables. 


The Universal Table in Fig. 6 shows how the tenants’ data 
are stored in the universal table. This table has a number of 
columns, including ‘tenant id’, ‘table id’, and ‘col_l’ until 
‘coi n’. The ‘tenant_id’ column is used to map rows with their 
tenants. The ‘table_id’ column is used to map rows to a 
particular table. The columns, including ‘col 1’ until ‘coi n’ 
are the universal columns that store any data the tenants wish to 
store to fulfil their business requirements. 
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Fig. 6 Universal Table. 


The Pivot Tables in Fig. 7 show how the tenants’ data with a 
specific data type is stored in a specific pivot table. In this 
example, we have two pivot tables, the first table is ‘pivot_int’ 
that stores INTEGER data values, and the second table is 
‘pivot_str’ that stores STRING data values. Each pivot table has 
standard columns, including ‘tenant_id’, ‘table’, ‘col’, and 
‘row’. In addition to a column that can vary in each pivot table 
according to the data type that is specified for that table. For 
instance, the pivot table that stores STRING values will have a 
column that stores STRING values, and the column name could 
be called ‘str’. The ‘tenant id’ column is used to map each row 


in a pivot table with a tenant. The ‘table’ column is used to map 
a data type value to a particular table. The ‘col’ column is used 
to map a data type value to a particular column in a particular 
table. The ‘row’ column is used to map a data type value to a 
particular row in a particular table. 


The Chunk Table in Fig. 8 shows how a set of data columns 
with a mixture of data types is structured. The ‘chunk_int_str’ 
table has six columns, including ‘tenant_id’, ‘table’, ‘chunk’, 
‘row’, ‘inti’, and ‘strl’. The ‘tenant id’ column is used to map 
each table row in a chunk table with a tenant. The ‘table’ 
column is used to map a table row to a particular table. The 
‘chunk’ column is used to compound data for more than one 
logical column for a particular table. The ‘row’ column is used 
to map a data value to a particular row in a particular table. The 
‘inti’ column is used to store all the INTEGER data values for 
different columns of different tables. The ‘strl’ column is used 
to store all the STRING data values for different columns of 
different tables. 
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Fig. 8 Chunk Table. 


The Chunk Folding tables in Fig. 9 show how the most 
commonly used tenants’ columns are structured in the 
‘account row’ table, while the remaining columns are 
structured into Chunk Folding table called ‘chunkrow’. The 
remaining columns that are used by tenants have extra business 
requirements, which are not applied in the common columns in 
the ‘account row’ table. The ‘tenant_id’ column in both tables 
is used to map each table row with a tenant. The ‘row’ column 
in both tables is used to map a data value in a particular row of 
a particular table. The table ‘account_row’ consists of five 
columns, including ‘tenant_id’, ‘row’, ‘sales_person_id’, 
‘first_name’, and Tastname’. The last three columns in this 
table are the common columns that are shared by the three 
tenants (Tenant- A, Tenant-B, and Tenant-C). The ‘chunk row’ 
table consists of six columns, including ‘tenant_id’, ‘table’, 
‘chunk’, ‘row’, ‘inti’, and ‘strl’. The ‘table’ column is used to 
map a row to a particular table. The ‘chunk’ column is used to 
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Fig. 7 Pivot Tables. 
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combine data for more than one column for a particular table. 
The ‘inti’ column is used to store all the INTEGER data values 
for different columns of different tables. The ‘strl’ column is 
used to store all the STRING data values for different columns 
of different tables. 
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Fig. 9 Chunk Folding. 


The XML Table in Fig. 10 shows how this technique 
combines RDBMS and XML, by having fixed columns shared 
by all tenants, including ‘tenant_id’, ‘salesperson_id’, 
‘firstpame’, Tast_name\ The ‘tenantid’ column is used to 
map each table row in the ‘accountrow’ table with a tenant. 
The rest of the columns are Sales Person columns that are 
shared by all tenants. The fifth column is ‘extxml’, this column 
is used to store an XML structure includes the rest of the logical 
columns that tenants may need to fulfil their extra business 
needs. For instance, as shown in the first table row in the 
‘account_row’ table, there are three values stored using XML 
structure in the ‘ext_xml’ column, including phone, age, and 
gender. 
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Fig. 10 XML Table 


Fig. 11 shows an example of the EET, which have three 
VETs that were created using the ETs. These three VETs are 
the tenants’ tables that presented in the Private Tables in Fig. 4. 
In this example, the ‘salesperson’ table is a CTT shared by all 
the three tenants and has predefined columns that are commonly 
used by these tenants. The Tenant-A has a business requirement 
to have a Sales Person table that includes the columns that 
predefined in the ‘salesperson’ CTT, in addition to three extra 
columns, including ‘phone’, ‘age’, and ‘gender’. This business 
requirement can be fulfilled by creating the 
‘salesperson_tenant_a’ VET, and adding to this table these 
extra three columns. In addition to, adding the 
‘salesperson id’ column that is a virtual foreign key, which 


builds the virtual relationship between ‘salesperson_tenant_a’ 
VET and the ‘salesperson’ CTT. The Tenant-B has a business 
requirement to have a Sales Person table that includes the 
columns that are predefined in the ‘salesperson’ CTT, in 
addition to the ‘business_id’ column as an extra column to the 
CTT. This business requirement can be fulfilled for this tenant 
by creating the ‘salesperson_tenant_b’ VET, in addition, 
adding the ‘salesperson id’ column that is a virtual foreign 
key, which builds the virtual relationship between 
‘salesperson_tenant_b’ VET and the ‘salesperson’ CTT. The 
Tenant-C has a business requirement the same as the business 
requirement of Tenant-A. Therefore, the 
‘salesperson_tenantp’ VET of the Tennant-C has a similar 
structure and relationship of the ‘salesperson_tenant_a’ VET. 
The shared columns of the ‘salesperson’ CTT store the three 
tenants’ data, while the rest of the tenants’ data is stored in 
VETs by using the ETs, including ‘dbtable’, ‘tablepolumn’, 
‘table_row’, ‘tablerelationship’, table_index’, and 
‘tableprimary_keypolumn’. The details of this data are 
shown in Fig. 12 - 18. 


Virtual Extension Tables Common Table 



Fig. 1 1 Virtual Extension Tables (VET). 
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Fig. 12 The data stored in the ‘ salesperson ’ CTT 
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Fig. 13 The data stored in the ‘db table’ ET 
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Fig. 14 The data stored in the ‘table column’ ET. 
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Fig. 15 The data stored in the ‘table relationship’ ET. 
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Fig. 16 The data stored in the ‘table_primary_key_cohirmT ET. 
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Fig. 17 The data stored in the ‘table row’ ET. 
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Fig. 18 The data stored in the ‘table index’ ET. 


VI. Performance Evaluation 

In this section, we compare the performance of accessing 
data from EET and Universal Table Schema Mapping (UTSM) 
[2]. In EET, data is partitioned vertically, when in UTSM data 
is partitioned horizontally. Liao et al. [2] state that the data 
architecture of UTSM is similar to Salesforce data architecture, 
which originated from the Universal Relations [6] . In addition, 
a number of database query examples presented in [2], [3], and 


used to retrieve data from this data architecture. Some of these 
queries are used in the experiments in this paper, in addition to 
other queries that are used to show the difference in accessing 
data from EET and UTSM. The UTSM technique had to be 
chosen to compare it with EET technique, because as discussed 
and concluded in the related work section, the Universal Table 
that is used in UTSM, is considered as the optimal schema 
design for multi-tenant applications. Moreover, this is one of 
the multi-tenant database schema techniques implemented 
commercially by Salesforce. The data architecture of UTSM is 
shown in Fig. 19. The ‘Data’ table is the universal table that 
stores all tenants’ data, and it has fixed number of data columns. 
The number of columns of this table should be large to 
accommodate the number of columns required by different 
tenants (e.g. Salesforce uses 500 columns for this table). These 
columns store data that maps to objects and fields created in the 
‘Objects’ and ‘Fields’ tables. The data type of these columns is 
VARCHAR, which allows the storage of different data types 
(STRING, NUMBER, DATE, etc.). The ‘Objects’, ‘Fields’, 
and ‘Relationships’ tables are used to construct virtual tables 
and their virtual columns, and build relationships between these 
virtual tables. Whereas the ‘Index’ and ‘Unique fields’ tables are 
used to optimize the query execution time of retrieving data 
from the ‘Data’ universal table [1], [2]. 

In this performance evaluation, the focus is on comparing the 
performance of accessing data from EET and UTSM directly 
from the database level, irrespective of the software solution 
built on top of these two multi-tenant database schemas for two 
reasons: (1) The most significant challenge in multi -tenant 
applications is designing multi-tenant database schema that 
improves multi-tenant query processing. This schema design 
influences the software design built on top of the schema and 
its performance. (2) Comparing the performance of two multi - 
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tenant software solutions under the same conditions, and using 
the same hardware resources is difficult, in particular as some 
software may not be available to be installed on the same 
application server. 


Objects Data Index 



Fig. 19 Universal Table Schema Mapping [2]. 


A. Experimental Data Set and Setup 

Typically, multi-tenant databases store massive data volumes 
across multiple servers to optimize the performance of data 
retrieval. However, before considering scale-up or scale-out for 
multi-tenant databases to optimize its performance, we believe 
that we should perform a comparison between EET and UTSM 
using a single server instance. In order to test the effectiveness 
of accessing data from these two multi-tenant database 
architecture designs without affecting their performance by 
using any scalability. In our experiments, we focus on 
benchmarking the performance of the main tables of both data 
architectures where most of the tenants’ data is stored, and we 
disregard the lookup queries. For example, in EET, we discard 
the queries which check whether a virtual column is indexed or 
not from the ‘table_column’ ET. On the other hand, we 
disregard the queries which check whether a column is indexed 
or not from the ‘fields’ table of UTSM. In this case, our focus 
in EET is on ‘table_row’, and ‘table_index’ ETs, and in UTSM 
is on ‘Data’, ‘Index’, and ‘Uniquefields’ tables. Furthermore, in 
order to run comparative experiments, exactly the same data 
was populated in the ‘table_row’, and ‘table index’ ETs of EET 
in a separate database, and the ‘Data’, ‘Index’, and 
‘Uniquefields’ tables of UTSM in another database. No indexes 
were used other than the default indexes of each schema, which 
are the primary keys and the foreign keys indexes that are 
automatically generated in the RDBMS once the primary key 
and foreign key constraints are specified. The number of virtual 
rows that were already populated in ‘table_row’ ET is 200,000 
rows and the same number of rows in the ‘Data’ universal table. 
These rows belong to the ‘product’ virtual table, and the 
structure of this table in EET and UTSM is shown in Fig. 20. 
There was no data populated in these two databases other than 
the populated 200,000 rows. 

In the multi-tenant database, each tenant’s data is isolated in 
a table partition. Therefore, the experiments are per-formed for 
one tenant to evaluate the effectiveness of retrieving data for 


each single tenant from the multi-tenant database. These 
experiments are divided into four types that are sharing the 
details of this data set. Each query of these experiments is 
performed ten times, and the average execution time of these 
queries is shown in Fig. 21-28. The queries that are related to 
EET and UTSM are shown in Table 1. The inputs and the 
outputs of EET and UTSM queries are the same. However, the 
structures of these queries are different because the data 
architectures of the two schemas are different. The four 
experiments details are listed below: 

The value of the ta0le_r0wjd of 

the tabie_row extension fable 


The value of the object Id 
of the UTSM Data table 

These are the columns' 
names of the virtual product 
table for EET and UTSM. The 
columns' names of EET are 
stored in table_column 
extension table, whereas the 
columns' names of UTSM are 
specified in the spear value 
columns of the Data table. 


Fig. 20. The virtual ‘product’ table structure. 

1 ) Retrieving Rows Experiment ( Exp. 1 ) 

The aim of this experiment is to benchmark the query execution 
time of retrieving rows from EET and UTSM. This experiment 
is divided into four experiments including: 

Retrieving Rows without Using Query Columns Filters 
Experiment (Exp. 1.1): In this experiment, Query 1 (Ql) and 
Query 2 (Q2) are executed. The Ql retrieves rows from the 
‘table_row’ ET of EET without specifying any query filters 
other than the tenant ID, and the ‘project’ table ID. Whereas the 
Q2 retrieves rows from the ‘Data’ universal table without 
specifying any query filters other than the tenant ID and the 
‘project’ object ID. In this study, eight tests using these two 
queries are performed to retrieve 1, 10, 50, 100, 500, 1000, 
1500, and 2000 rows. 

Retrieving Rows Using Columns Query Filters Experiment 
(Exp. 1.2): In this experiment, Query 3 (Q3) is executed on the 
‘table_row’ ET of EET and Query 4 (Q4) is executed on the 
‘Data’ universal table. Both queries are filtered by specifying 
particular numbers of product IDs stored in the ‘product’ virtual 
table. In this study, three tests using these two queries are 
performed to retrieve rows by specifying 1 product ID for the 
first test, 10 product IDs for the second test, and 50 product IDs 
for the third test. The structure of Q4 has presented in [3], but 
with different value settings. 

Retrieving Rows Using Primary Key Indexes Experiment 
(Exp. 1.3): In this experiment, Query 5 (Q5) is executed on the 
‘table_row’ and ‘table_index’ ETs of EET and Query 6 (Q6) is 
executed on the ‘Data’ and ‘Uniquefields’ tables of UTSM. In 
this experiment, a primary key index is used to retrieve rows 
from the ‘product’ virtual table from the ‘table_row’ ET and 
from the ‘Data’ table. In this study, three tests using these two 
queries are performed to retrieve 1, 10, and 50 rows. The 
structure of Q6 has presented in [2], but with different value 
settings. 
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Retrieving Rows Using Custom Index Experiment (Exp. 1 .4): In 
this experiment, Query 7 (Q7) is executed on the ‘table_row’ 
and ‘table_index’ ETs of EET and Query 8 (Q8) is executed on 
the ‘Data’ and ‘Index’ tables of UTSM. In this experiment, a 
custom index is used, which is a selective filter in the tenant’s 
query. This index should be other than the primary key and 
foreign key indexes. This custom index retrieves rows from the 
‘product’ virtual table for both ‘table row’ and ‘Data’ tables. 
The ‘standard_cost’ virtual column is chosen to filter the 
queries by looking up for all the products, which have a 
standard cost greater or equal ‘$ 9000’ from the ‘product’ 
virtual table. In this study, four tests using these two queries are 
performed to retrieve 1, 10, 50, and 100 rows. 

2 ) Inserting Rows Experiment ( Exp. 2 ) 

The aim of this experiment is to benchmark the query execution 
time of inserting rows into EET and UTSM. Query 9 (Q9) is 
executed on the ‘table_row’ and ‘table_index’ ETs of EET and 
Query 10 (Q10) is executed on the ‘Data’, ‘Index’, and 
‘Unique fields’ tables of UTSM. In this study, four tests using 
these two queries are performed to insert 1, 10, 50, and 100 
rows. 

3 ) Updating Rows Experiment ( Exp. 3 ) 

The aim of this experiment is to benchmark the query execution 
time of updating rows into EET and UTSM. Query 11 (Q1 1) is 
executed on the ‘table_row’ and ‘table_index’ ETs of EET and 
Query 12 (Q 12) is executed on the ‘Data’, and ‘Index’ tables of 
UTSM. In this study, four tests using these two queries are 
performed to update 1, 10, 50, and 100 rows. 

4 ) Deleting Rows Experiment ( Exp. 4 ) 

Deleting Rows Experiment (Exp.4): The aim of this experiment 
is to benchmark the query execution time of deleting rows from 
EET and UTSM. Query 13 (Q13) is executed on the 
‘table_row’ and ‘table index’ ETs of EET, and Query 14 (Q14) 
is executed on the ‘Data’, ‘Index’, and ‘Uniquefields’ tables of 
UTSM. In this study, four tests using these two queries are 
performed to delete 1, 10, 50, and 100 rows. 

The experiments were performed on PostgreSQL 8.4 
database, using the default configuration setup. This database 
installed on a PC with 64-bit Windows 7 Home Premium 
operating system, Intel Core i5 2.40GHz CPU, 8 GB RAM 
memory, and 500 GB hard disk storage. 

B. Experimental Result 

This section gives four experimental results as follows: 

1 ) Retrieving Rows 

This experimental result was divided into four results as 
follows. The experimental study of Exp. 1.1 shows that the 
execution time of Q1 that perform on the ‘table_row’ ET of 
EET is approximately 76% faster on average than the execution 
time of Q2 that perform on the ‘Data’ universal table when 1, 
10, 50, 100, 500, 1000, 1500, and 2000 rows were retrieved. 
The details results of this experiment are shown in Fig. 21-22. 
The experimental study of Exp. 1.2 shows that the execution 
time of Q3 that perform on the ‘table_row’ ET of EET is 
approximately 94% faster on average than the execution time 
of Q4 that perform on the ‘Data’ universal table when 1,10, and 
50 rows were retrieved. The details results of this experiment 
are shown in Fig. 23. The experimental study of Exp. 1.3 shows 


TABLE I 

The Experiments Queries 

Query Query Details 

No. 

Q1 SELECT * FROM table_row tr WHERE tr.table_row_id in (SELECT 

distinct(tr2.table_row_id) FROM table_row tr2 where tr2.db_table_id =16 

and tr2.tenant_id - 1000 LIMIT 1 ) ; 

Q2 SELECT * FROM data WHERE tenantid = 1000 and objectld = 1 LIMIT 

1; 

Q3 SELECT * FROM table_row tr WHERE tr.tenantjd =1000 and 
tr.db_table_id =16 and tr.table_column_id IN (50,52,54) and 
tr.table_row_id IN ( SELECT table_row_id FROM table_row tr2 WHERE 
tr2.tenant_id =1000 and tr2.db_table_id = 16 and ( tr2.table_column_id 

=47 and tr2. value = T 63336') ); 

Q4 [3] SELECT price, cost, weight FROM (SELECT valueO AS id, value4 AS 

price , value2 AS cost, value6 AS weight FROM data WHERE objectid = 1 

and tenantid = 1000 ) AS product WHERE id = ' 1 63336' ; 

Q5 SELECT * FROM table_row tr WHERE tr.tenant_id =1000 and 

tr.db_table_id =16 and tr.table_row_id IN (SELECT ti.table_row_id 
FROM table_index ti WHERE ti.tenant_id =1000 and ti.db_table_id =16 

and ti.table_column_id =47 and ti. value = '163337' ) 

Q6 [2] SELECT * FROM data WHERE objectid =1 and tenantid = 1000 and 

dataguid in (SELECT dataguid FROM uniquefields WHERE objectid = 1 

and tenantid = 1000 and num value IN ( 163337) ); 

Q7 SELECT * FROM table_row tr WHERE tr.tenant_id =1000 and 

tr.db_table_id =16 and tr.table_row_id IN (SELECT ti.table_row_id 
FROM table_index ti WHERE ti.tenant_id = 1000 and ti.db_table_id =16 
and ti.table_column_id = 50 and (cast (ti. value as numeric) >= ’9000') 

LIMIT 1); 

Q8 SELECT * FROM data WHERE objectid =1 and tenantid = 1000 and 
dataguid in (SELECT dataguid FROM index WHERE objectid = 1 and 

tenantid = 1000 and fieldNum =3 and numvalue > = 9000 LIMIT 1); 

Q9 INSERT into table_row (table_row_id, serial_id, tenant_id, value, 
db_table_id, table_column_id) values (50000061,1,1000, 

'50000000', 16,47); 

INSERT into table_row (table_row_id, serial_id, tenant_id, value, 
db_table_id, table_column_id) values (50000061,2,1000, '1000', 16,48); 
INSERT into table_row (table_row_id, serial_id, tenant_id, value, 
db_table_id, table_column_id) values (50000061,3,1000, '50000', 16,49); 
INSERT into table_row (table_row_id, serial_id, tenant_id, value, 
db_table_id, table_column_id) values (50000061,4,1000, '222.50', 16,50); 
INSERT into table_row (table_row_id, serial_id, tenant_id, value, 
db_table_id, table_column_id) values (50000061,5,1000, 'Red', 16, 51); 
INSERT into table_row (table_row_id, serial_id, tenant_id, value, 
db_table_id, table_column_id) values (50000061,6,1000, '242.50', 16,52); 
INSERT into table_row (table_row_id, serial_id, tenant_id, value, 
db_table_id, table_column_id) values (50000061,7,1000, '40', 16,53); 
INSERT into table_row (table_row_id, serial_id, tenant_id, value, 
db_table_id, table_column_id) values (50000061,8,1000, '300', 16,54); 
INSERT into table_index (tenant_id, value, table_row_id, serial_id, 
db_table_id, table_column_id ) values (1000, 

'50000000', 5000006 1 , 1 , 1 6,47) ; 

INSERT into table_index (tenant_id, value, table_row_id, serial_id, 

db_table_id, table_column_id ) values (1000, '222.50', 50000061, 4, 16,50); 

Q10 INSERT into data (dataguid, tenantid, objectid ,name, valueO, valuel, 
value2, value3,value4, value5 ,value6) values(50000061, 1000,1, 'name', 
'50000000', '50000', '222.50', 'Red', '242.50', '40', '300'); 

INSERT into uniquefields values (50000061, 1000, 1, 1,", 50000000, '2013- 
12-12'); 

INSERT into index values (50000061, 1000, 1, 3,", '222.50', '2013-12-12'); 

Qll UPDATE table_row set value = '230.50' WHERE tenantjd = 1000 and 
db_table_id =16 and table_column_id = 52 and table_row_id =50000061; 
UPDATE tablejndex set value = '230.50' WHERE tenantjd = 1000 and 

dbjablejd =16 and table_columnJd = 52 and table_rowJd =50000061; 

Q12 UPDATE data set value2 = '230.50' WHERE tenantid = 1000 and 
objectid = 1 and dataguid =50000061; 

UPDATE index set numvalue = 230.50 WHERE tenantid = 1000 and 

objectid = 1 and fieldnum =3 and dataguid =50000061; 

Q13 DELETE from tablejndex WHERE tenantjd = 1000 and db Jable Jd = 

16 and table_rowJd =50000061; 

DELETE from table_row WHERE tenantjd = 1000 and db Jable Jd =16 

and table_rowJd = 50000061; 

Q14 DELETE from index WHERE tenantid = 1000 and objectid = 1 and 
fieldnum =3 and dataguid =50000061; 

DELETE from uniquefields WHERE tenantid = 1000 and objectid = 1 and 
fieldnum =1 and dataguid =50000061; 

DELETE from data WHERE tenantid = 1000 and objectid = 1 and 
dataguid =50000061; 
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Fig. 21 Retrieving small numbers of rows (Exp. 1.1) 
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Fig. 25 Retrieving rows using custom indexes 
(Exp. 1.4) 
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Fig. 27 Updating rows (Exp. 3) 
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Fig. 24 Retrieving rows using PK indexes (Exp. 1.3) 
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Fig. 26 Inserting rows (Exp. 2) 
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Fig. 28 Deleting rows (Exp. 4) 


that the execution time of Q5 that perform on the ‘tablerow’ 
and ‘table_index’ ETs of EET is approximately 88% faster on 
average than the execution time of Q6 that perform on the 
‘Data’ and ‘Unique fields’ tables of UTSM when 1, 10, and 50 
rows were retrieved. The details results of this experiment are 
shown in Fig. 24. The experimental study of Exp. 1.4 shows that 
the execution time of Q7 that perform on the ‘table_row’ and 
‘table_index’ ETs of EET is approximately 60% faster on 
average than the execution time of Q8 that perform on the 
‘Data’ and ‘Index’ tables of UTSM when 1, 10, 50, and 100 
rows were retrieved. The details results of this experiment are 
shown in Fig. 25. 

2 ) Inserting Rows 

The experimental study of Exp. 2 shows that the execution time 
of Q9 that perform on the ‘table_row’ and ‘table_index’ ETs of 
EET is approximately 19% slower on average than the 
execution time of Q10 that perform on the ‘Data’, ‘Index’, and 
‘Unique fields’ tables of UTSM when 1, 10, 50, and 100 rows 


were inserted. The details results of this experiment are shown 
in Fig. 26. 

3) Updating Rows 

The experimental study of Exp. 3 shows that the execution time 
of Qll that perform on the ‘table_row’ and ‘table_index’ ETs 
of EET is approximately 51% faster on average than the 
execution time of Q12 that perform on the ‘Data’, and ‘Index’ 
tables of UTSM when 1, 10, 50, and 100 rows were updated. 
The details results of this experiment are shown in Fig. 27. 

4 ) Deleting Rows 

The experimental study of Exp.4 shows that the execution time 
of Q13 that perform on the ‘table_row’ and ‘table_index’ ETs 
of EET is approximately 32% faster on average than the 
execution time of Q14 that perform on the ‘Data’, ‘Index’, and 
‘Unique fields’ tables of UTSM when 1,10, 50, and 100 rows 
were deleted. The details results of this experiment are shown 
in Fig. 28. 
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I. Conclusion and Future Work 

In this paper, we propose a novel multi-tenant database 
schema design called EET, which consists of CTT, ET, and 
VET. EET allows tenants to create their own virtual database 
schema, including the required number of tables, columns, 
rows, virtual database relationships with CTTs or VETs, and 
assigns suitable data types and constraints for columns during 
the runtime of multi-tenant applications. EET is a single multi- 
tenant database schema that has a flexible way of creating 
database schemas for multiple tenants, by extending a business 
domain database based on RDBMS, or creating tenants 
business domain database from the scratch. EET design 
improves the multi-tenant database performance by avoiding 
NULL values, assigning primary keys to unique columns, 
providing indexes to table columns, and storing BLOB and 
CLOB data types in separate designated tables. In addition, 
EET design allows the storage of different data types, including 
structured, semi-structured, and unstructured data. In this paper, 
we only use structured data for the empirical evaluation, for two 
reasons. First, storing and retrieving data in XML files (semi- 
structured data) has the highest response time among the 
reviewed multi-tenant database schema designs [14], [23]. 
Thus, while semi -structured data can be stored in EET, it is not 
recommended as storage for multiple tenants. Second, there are 
many techniques for storing and retrieving different data types, 
and comparing all of these techniques with EET within the 
scope of a single paper is difficult due to the paper length 
limitations. 

EET approach allows the creation of virtual relationships 
between the tenants’ shared physical tables (CTT) and the 
tenants’ virtual tables (VET), and allows tenants to choose from 
three database models: (1) Multi-tenant Relational Database, 
(2) Integrated Multi-tenant Relational Database with Virtual 
Relational Database, and (3) Virtual Relational Database. 
According to our knowledge, this capability is not included in 
any other multi-tenant database schema design. 

We have compared and evaluated the performance of EET 
and UTSM. The design of EET partitions data vertically to 
avoid storing rows with NULL values. In contrast, the design 
of the Universal Table in UTSM partitions data horizontally, 
which can be associated with significant overheads as a result 
of a large number of NULL values. The experimental study 
reported in this paper shows an improvement when retrieving, 
updating and deleting data from EET over the UTSM. In 
particular, the experiments of retrieving data from EET indicate 
better performance when compared to UTSM. The execution 
time for inserting rows into EET is slightly longer than for 
inserting rows into UTSM. Overall, this experimental study 
makes the EET schema a good candidate for implementing 
multi-tenant databases and multi-tenant SaaS applications. As 
discussed in the related work section, the Universal Table used 
in UTSM is widely accepted as an optimal schema design for 
multi-tenant applications. Therefore, this study measured the 
feasibility and effectiveness of EET by comparing it with 
UTSM. Comparing EET with other existing multi-tenant 
database schema designs that are based on RDBMS and other 
data storage models will be considered in our future research. 
Furthermore, in our future research, we will evaluate the 


performance of EET using multiple tenants and focusing on the 
scalability of the EET approach. 
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Abstract — The availability of network services are 
being menaced by the increasing number of Denial- 
of -Service (DoS) attacks. The availability of such 
interconnected systems is severally degraded by 
increasing number of DOS attacks. Denial-of- 
Service (DoS) attacks cause serious impact on these 
computing systems such as router , host or entire 
network. DoS attack detected using Multivariate 
Correlation Analysis (MCA) technique. Multivariate 
correlation analysis employs for accurate network 
traffic characterization by extracting the 
geometrical correlations between network traffic 
features. The proposed system uses the Multivariate 
Correlation Analysis (MCA) technique for accurate 
characterization also uses the anomaly based 
detection technique in attack recognition . Anomaly 
based detection makes system capable of detecting 
seen and unseen attacks. Moreover , a triangle area 
based technique is planned to reinforce and 
increases performance of MCA. The impact of each 
non-normalized information and normalized 
information on the performance of the proposed 
detection system is tested. 

Keywords — Denial- of- Service attack, network 
traffic characterization, multivariate correlations, 
triangle area. 


i. Introduction 

Most of the existing commercial IDS 

products are signature-based but not 
adaptive or self learning. A common 
methodology employed in DENIAL-OF- 
SERVICE (DoS) attacks is to deluge the 
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Department of Computer Engineering 
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system with a flood of useless packets that 
leading to online system crashes. 

DoS attacks have emerged as a sort 
of network intrusive behaviors and have 
exhibit serious threats to the infrastructures 
of computer networks and numerous 
network-based services. DoS attacks 
severely decrease the provision of a sufferer, 
which might be a host, a router, or a whole 
network. The sufferer can be forced it to 
prevent providing services to alternative 
parties of network from a couple of minutes 
to many days. So, effective mechanism for 
Denial-Of-Service (DOS) attack detection 
are extremely demanded to safeguard 
services running on the sufferer. The till 
principally focus on network primarily 
based attack detection. The system 
supported this mechanism monitor traffic 
passing on the protected network. This 
mechanism deterrent the servers from 
monitoring attack and make sure that the 
servers facilitate with quality services in 
minimum delay while responding. Moreover 
network-based detection systems are 
classified in 2 main approaches. Misuse 
based detection system employs to spot far 
famed intrusions with the help of signature 
of antecedently define rules. Misuse-based 
detection system having low false positive 
rate and provides accuracy in detecting work 
far-famed attacks. This is often not 
applicable just in case of unknown attacks. 
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As signature rules are generated manually, it 
becomes difficult to keep updating and to 
safeguard network security. To overcome 
the drawbacks of misuse based detection 
system in addition to archive novelty 
tolerant detection system and to develop a 
new advance idea like anomaly based 
detection system. Anomaly-based system 
detects known as well as unknown attacks 
by monitoring network behavior. 
Furthermore paper focus on feature 
correlation analysis. In this approach entire 
incoming group of traffic observed as 
legitimate or illegitimate but not individual 
from group of traffic. . To deal with this, 
paper employs approach supported triangle 
area to get an improved normalized feature. 
However approach is depending on 
information of malicious behavior. Denial of 
Service (DOS) attacks are unlimited menace 
to internet websites and among the 
troublesome security hassle in these days 
web. The problem of Denial of service 
attacks has become well known, but it’s 
been troublesome to look the Denial of 
Service on the net. During this MCA based 
detection system to safeguard online servers 
against Dos attack that is based on previous 
work in [16]. In this paper Denial-of service 
(Dos) attack detection system employs the 
triangle area based detection technique and 
anomaly based detection technique. Such 
system provides correct characterization of 
traffic and detection of known and unknown 
intrusions. 


ii. Related Works 

Sharmila Wagh, V. Pachghare, S. Kolhe [1] 
proposed the idea of applying machine 
learning techniques for intrusion detection is 
to automatically build the model based on 
the training data set. This data set contains a 
collection of data instances each of which 
can be described using a set of attributes 


(features) and the associated labels. The 
attributes can be of different types such as 
categorical or continuous. Machine 
Learning Intrusion Detection system has 
been giving high accuracy and good 
detection of novel attacks. V. Paxson [17] 
describes that increasing Internet 
connectivity is good opportunities for 
attackers to get entry in computers over the 
network. The detecting such attacks are 
termed network intrusion detection, 
enormously new area of security research. 
From a security tracking perspective, drops 
can completely defeat the monitoring, since 
the missing packets may contain precisely 
the thrilling site visitors that identify a 
network intruder. Given our first layout 
requirement excessive-speed tracking then 
averting packet clear out drops will become 
another robust requirement. Aruna 
jamdagni, zhiyuan tan, priyadarshani nanda, 
r ping hill [14] proposed the principle 
component analysis technique that employs 
data preprocessing, mahalanobis map for 
extracting features from incoming packets. It 
also propose iterative feature selection 
engine for characteristic choice cause. This 
gadget detects payload based assaults in real 
time. Z.Tan, A. Jamdagni, X. He, P. Nanda 
and R.P.Liu [15] illustrate that the detection 
of dos attack is essential for safety of the 
online offerings services. The DOS assault 
detection mainly specializes in the 
development of the network primarily based 
detection mechanism [3]. The detection 
structures have two strategies specifically 
misuse detection and anomaly detection. 
Misuse detection is used to identify the 
known attacks, using the signatures of 
predefined rules [2]. The relied on profile 
generation is constructing and handed over 
to the assault detection module, which 
compares the tested profile with the normal 
profile. 

Z. Tan, A. Jamdagni, X. He, P. 
Nanda [14] describes Dos attack temporarily 
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prevent services from connecting to the 
internet. The detection, which monitors any 
network activity presenting any significant 
deviation from their normal profiles as a 
suspicious intrusion. It also proposed the 
analysis method that employs information 
preprocessing, mahalanobis map for 
extracting functions from incoming packets. 
It also propose iterative feature selection 
engine for characteristic choice cause. 

This gadget detects payload based assaults 
in real time. It has 3 key capabilities. First, 
for anomaly detection operates on aggregate 
visitors, without glide separation or deep- 
packet inspection. Both of those traits are 
vital for a realistic and deployable anomaly 
detection technique. At the same time as it’s 
far real that the source and destination IP 
addresses of each packet are usually 
available on the routers, port numbers aren’t 
available without glide separation. A few 
previous works makes use of features 
associated with the supply and destination 
port numbers and so will not be capable of 
hit upon anomalies in aggregate or VPN 
tunneled visitors. Notice that working on 
combination visitors is sufficient to detect 
anomalies. Shuyuan Jin, denial so young, 
xizhao wang [11] illustrate that covariance 
matrices employs multiple network attack 
detection. Network based detection system 
categorized in two different approaches 
misuse based detection and anomaly based 
detection. Misuse based detection have low 
false positive rate as well as accurate 
detection of known attacks. This is not 
applicable in case of unknown attack. It uses 
signature of predefined rule. Anomaly 
system detects known as well as unknown 
attacks by monitoring network behavior 
presenting significant deviation from 
legitimate traffic as suspicious object. The 
paper proposes the covariance matrices to 
find out the impact of coherent relations and 
feature depending on multiple attacks. The 
effectiveness of intrusion detection system 


by evaluating percentage of known and 
unknown attack. A covariance matrix keep 
two types of information: first is information 
in group of samples and second is 
correlation information among the observed 
features. Furthermore, the covariance based 
detection employs performance 

improvement by using group of samples in 
the detection and efficiency differentiate 
different classes where mean based 
detection approaches fails. 

p.garcia- teodoro, j. Diaz-Verdejo 
and E. Vazquez [2] puts the adaptive 
security related approaches .anomaly based 
intrusion system protect online or protected 
system against malicious behavior. Intrusion 
detection system architectures is based four 
functional modules: 

1. Analysis boxes: processing modules for 
analyzing and detecting potentials behavior. 

2. Database boxes: This element uses to 
store information from E blocks. 

3. Response boxes: the intention of this type 
of block is the execution. 

The intrusion detection system may be either 
host based or network based. Host mainly 
analyzes event related to OS information 
such as system calls. Intrusion detection 
system is categorized as either signature or 
anomaly based detection system. Signature 
based defined pattern within analyzed 
pattern, within analyzed data. Anomaly 
system detects known as well as unknown 
attacks by monitoring network activities 
presenting deviation from legitimate traffic 
as suspicious object. Chih-fong tsai, chai- 
ying lin [13] described a learning model 
based on triangle area based nearest 
neighbors (TANN) in order to detect attack 
with accuracy. This method illustrate that 
the technique of triangle area based nearest 
neighbors (TANN) by combining 
unsupervised and supervised learning 
technique to detect attacks. The 
classification technique is used as 
component and then clustering technique. 
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Like supervised it is not able to distinguish 
data with accuracy. Hence, initially 
classifier is trained and then it provides 
output. Then this output is given as a input 
to cluster for the purpose of improving 
clustering performance. The proposed 
TANN is composed of 3 steps: 1) clusters 
center extraction, 2) new data formation by 
triangle area and KNN training, 3) training 
and testing based on new data. The centroids 
from given dataset having capabilities of 
distinguish between similar and dissimilar 
data or classes. Therefore triangle area 
represents the new features for evaluating 
similar attack. Then KNN classifier used 
features of triangle area to detect attack. 

III. MULTIVARIATE 

CORRELATION ANALYSES IN 
DETAIL 

Incoming network attack traffic treats 
diversely from normal network traffic and 
those are reflected by statistical properties. 
To describe such properties MCA technique 
is used. The network attack traffic causes 
changes to the correlation of features. So 
that such changes can be used for identify 
malicious behavior. 

The MCA employs triangle area for 
extracting the geometrical correlation 
between features of incoming network 
traffic. All triangle areas are arranged on the 
map. The values of the diagonal elements 
are set to zero because only considering 
correlation between each pair of distinct 
features of incoming traffic. Then 
comparing two TAMs along with their main 
diagonal. To make immediate comparison 
between two TAMs choose either upper or 
lower triangle of TAM. 

The advantages provided by the 
MCA technique are as follows: 

1. It does not require the knowledge 
about historical traffic to perform 
analysis. 


2. It provides the characterization for 
individual network traffic record. 

Given an arbitrary dataset 
x T = [x T1 ,x T2 ,x T3 ,....,x Tn ] 

Where x T = L/i ./ 2 ./ 3 1 . ■■■ ■ >fm\ represents ith 
m- dimensional record. Where f/ is the 
value of the 1th feature in the ith traffic 
record. To achieve the triangle build by 
using two features. For analysis purpose all 
possible permutations of any two distinct 
features in the vector X are extracted. With 
the help of those features triangle areas are 
formed. 

IV. PROPOSED SYSTEM 
A. Architecture 

For accurate network traffic characterization 
our system uses Multivariate Correlation 
Analysis (MCA) .In decision making system 
incoming requests are compared with the 
normal profile. In this system there are three 
types of actors, such as user, expert user and 
admin. So, when user wants service of 
system, he sends a request. The expert user 
accepts the request and then provides 
services to the user. The admin having the 
capabilities that he is able to know this all 
interaction between user and expert user. 
When user trying to send the request 
containing malicious data then system 
recognize the attacks in it. Admin able to 
block the respective user. In this system 
every single request is detected by 
comparing with normal profiles. 
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Fig.l. Architecture of dos attack detection 
system 

B. Modules 

1 ) Network Traffic 

Capturing the packets from incoming 
network traffic. Instead of monitoring 
malicious traffic, concentrate on related 
inbound traffic. 


3 ) Decision Manager 

The anomaly based detection method is 
employs in Decision Making and 
responsible to filter the malicious data and 
traffic data. It detects DOS attacks without 
having relevant knowledge. 

4) Normal Profile 

In the Training Phase the normal profiles 
build for various types of legitimate 
requests, and the generated normal profiles 
are stored in a database. 

5) Detection Phase 

In the test phase The Tested Profile builds 
for the purpose of observing the individual 
requests coming from MCA module. Such 
observed requests are then sent for attack 
detection. The Attack Detection module 
compares tested profiles with the respective 
stored normal profiles. If the dissimilarity is 
more than the expected threshold, then 
particular request is recognized as malicious 
request and that user is blocked by the 
system. Else recognize as a normal and 
provide the requested service. 

C. Normal Profile Generation 

Algorithm 


2) MCA 

Multivariate Correlation Analysis contains 
Triangle Area Map Generation module. The 
triangle area is used to calculate the relation 
between incoming packets. The feature 
normalization module is used to normalize 
traffic record. The malicious requests 
change the correlation. Hence, changes can 
be identifying as intrusive activity. 


1: Input network traffic of n element. 

2: Extract original features from individual 
records. 

3: Apply the technique of triangle area to 
extract the geometrical correlation between 
the extracted features in the vector x. 

4: Normal profile generation. 

i. Generate triangle area map of each 
record. 

ii. Generate covariance matrix. 

iii. Calculate MD between legitimate 
records TAM and input records TAM. 


436 


https://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 









(IJCSIS) International Journal of Computer Science and Information Security, 

Vol. 14, No. 6, JUNE 2016 


iv. Calculate mean. 

v. Calculate standard deviation. 

vi. Return pro. 

5: Attack Detection. 

i. Input: observed traffic, normal profile 
and alpha. 

ii. Calculate MD between normal profile 
and incoming traffic. 

iii. If MD < = threshold 
Detect Normal 
Else 

Detect attack. 


D. Mathematical Model 


Input: Incoming network traffic record. 
System S = {NP, MD, TD, T} 

Where, 

S be the system 
NP = Normal Profile 
MD = Mahalanobis Distance 
TD = Traffic Detection 
T = Threshold 

Output: Traffic Detection 

TD = {AT, NT) 

Where, 

AT = Abnormal Traffic 
NT = Normal Traffic. 

E. Features of proposed System 

The DoS attack detection system presented 
in this paper employs the principles of MCA 
and anomaly-based detection. They equip 
our detection system with capabilities of 

1. Detect of known and unknown attacks 
respectively. 

2. To speed up the process of MCA. 


3. Eliminate bias from raw data by using 
normalization technique. 


F. Detection rate and false positive 
rate of normalized data 

Threshold 

Iff 1.5ff 2 ff 2,5ff 3ff 

FPR T93% U9% 063% 060% 0.58% 

DR 100.00% 99.83% 99.68% 99.68% 93.35% 

Accuracy 99.95% 99.81% 99.67% 99.67% 93.50% 

As shown in above table, the threshold 
controls the degree of the dissimilarity, 
which is accepted by the system, between a 
test object and the respective learnt normal 
profile. If the dissimilarity is beyond the 
determined threshold, the test object is 
classified as an attack. On one hand, it can 
be seen clearly from Table that a better FPR 
is achieved when a greater threshold is 
accepted. On the other hand, greater 
thresholds produce lower DRs. 


G. Experimental Details 

In this system there are 3 actors such as user, 
expert user and admin. The admin handles 
the overall server providing services. The 
user sends request to access the services 
from the system. The expert users, then 
accept the request and then reply with the 
expected service. Suppose if the user trying 
to send the malicious request, then system 
detect as an attack else normal. The admin 
has the ability that he can block that 
particular user 

Figure 2 and 3 describes about user 
registration and successful login 
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respectively. Figure 4 shows user able to 
send requests to the system. As shown in 
figure 5 traffic is captured from the network. 
Figure 6 shows an administrators dash board 
which contained detailed information about 
normal user’s requests and expert user’s 
details. Attack detected by the system when 
incoming traffic deviates from normal 
profiles as shown in figure7. 



Fig. 2. User Registration 



Fig. 3. User Login 
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Fig. 4. User Profile 
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Fig. 5. Incoming Traffic 


Fig. 7. Attack Detected by System 



Fig. 6. Admin Profile 


V. Conclusion 

This MCA-based DoS attack detection 
system is powered by a triangle-area based 
MCA technique and an anomaly-based 
detection technique. This technique extracts 
geometrical correlations hidden in individual 
pairs of two distinct features within the each 
network traffic record, and offers more 
accurate characterization for network traffic 
behaviors. The normalized data provides 
more accuracy in detection. This system is 
able to distinguish between known and 
unknown DoS attacks from proper network 
traffic. 
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Abstract- SQL Injection vulnerability takes 
advantages of the poorly coded web application 
and exploits the sensitive and critical 
information stored in an application’s database 
by compromising the authentication logic of the 
database server. In Most of the web applications 
user inputs in the dynamic web pages are the 
vulnerable points for SQL injection attack. A 
Single detection tool cannot handle the 
sophisticated injection attacks by the intelligent 
hackers. The proposed hybrid model with 
SQLI-Reju Venator on an Application Program 
Interface is tested and proved as an efficient 
technique to detect and prevent SQL injection. 
In this architecture, the malicious queries are 
blocked and an alert message is generated if the 
injection is detected. Only the benign query is 
allowed to access the data from the backend 
database server. The Unique identity created by 
the template creator application, the 
Reju Venator module and evaluation engine are 
significant features of the proposed model to 
prevent the Injection attack and can facilitate 
better availability of the application. 

Keywords - Authentication; Injection; Vulnerability; 
Hackers; Detection ; Rejuvenation; 

1. INRODU CTION 

Most of the SQL injection 
vulnerabilities occur when the on line application 
does not validate the input entries accepted through 
web forms, cookies and other input parameters 
.These type of attacks by-pass the authentication 


logic, breaks the confidentiality of the database. 
Vulnerability scanners, Similarity measures and 
many other automated tools for verifying SQLIA 
statically in source code and dynamic validation 
during execution are also available within the web 
applications. A single tool or procedure cannot 
effectively handle the upcoming sophisticated 
attacks. SQL Injection has been an issue for many 
years, and there are enough researches carried out 
to tackle this situation yet the risk rate of SQL 
injection is increasing exponentially in proportional 
to the behavior and size of online business 
applications!!]. Most of the commercial 
applications are offering closer interactions to its 
users or visitors than earlier in order to be 
competent, these features should be implemented 
with appropriate security measures. As there are 
fully automated injection tools available with the 
intelligent hackers they are discovering better 
methods and services that are susceptible to SQL 
Injection attack, which can execute the malicious 
injection even in the old application. Most of the 
available validation tools required source code 
modification which is a tedious task and will affect 
the performance of the underlying web application 
and the storage requirement is also high [2]. 

II. MOTIVATION 

Web application features like logon 
pages, contact forms, search function, feedback 
fields and the functions used for delivery of 
dynamic contents are all susceptible to SQL 
injection attacks even after the implementation of 
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model checking, Vulnerability scanners and 
firewalls. There should be an effective mechanism 
to handle this situation[3]. As the popularity of web 
applications demand better user interactions for the 
routine services, the sophistication of attacks is also 
growing proportionally and there is a requirement 
for a stronger method to prevent any kind of 
exploits on the sensitive information through the 
vulnerable points. Most of the existing SQLIA 
prevention approaches target only a subset of 
SQLIA attack types, a single service can handle 
only a portion of the attack spectrum and only a 
few approaches are developed to handle first and 
second order injections in parallel [4]. There is high 
demand for an effective approach which can work 
under a lighter storage specification, without false 
positives and reduced time space complexity [5]. 

III. SQL INJECTION 

SQL injection vulnerability is a 
weakness in the web application source code, 
commonly occur when there is an improper 
validations on the values received from web form, 
cookie and other input parameters. SQL-Injection 
attack is one of the top listed vulnerability by 
OWASP that can be classified under immediate or 
persistent attack and mostly referred as first order 
and second order attack respectively [18] . If a 
malicious user can control the input send to an SQL 
query, where the data is interpreted as code, he may 
be getting a malicious entry to the backend server 
for manipulating the confidential data that can 
compromise all sensitive and critical information 
stored in backend database. 

A. SQL-Injection Categories 

Some of the sample Intended/legal 
queries , Input queries and detection field identified 
with the proposed template creator application by 
the users are tested in the proposed model and the 


detection field in the template creator application is 
listed as: 

ljlntended Query: SELECT username, pas sword 
FROM users WHERE lastname=$lastname 
AND firstname=$firstname; 

Injected Query: SELECT username, pass word 
FROM users WHERE lastname=$lastname 
AND firstname=$firstname AND $status IN 
(SELECT statuses from STATUS WHERE 
pid=$pid OR pname=$pname); 

Detection Fields: identified with template creator 
application: Operator, Query Type, Fields, Tables. 

2) Intended Query.UPDATE users SET 
password^ 'Nicky' WHERE id=2 and 
username='01ivia'; 

Injected Query :UPD ATE users SET 
password^ 'Nicky' WHERE id=2 and 
username='01ivia'; 

SHUTDOWN; 

Detection Fields: Number of independent Queries 

3) Intended Query:SELECT ProductName, 
QuantityPerUnit, UnitPrice FROM Products 
WHERE ProductName LIKE 'G%'; 

Injected Query:SELECT ProductName, 
QuantityPerUnit, UnitPrice FROM Products 
WHERE ProductName LIKE 'G% ' UNION 
SELECT name, type, id FROM sysobjects;- 
Detection Fields: Operator, Fields, Comment 

4) Intended Query:UPDATE users SET 
password^ 'Nicky' WHERE id=2 and 
username='01ivia'; 

Injected Query :UPD ATE users SET 
password- Nicky' WHERE id=’2’ AND 
username=’hai’ or °=”; 

Detection Fields: Operator. 

5) Intended Query: INSERT INTO users 
(username, password) VALUES ('jack',"); 

Injected Query:INSERT INTO users 
(username, password) 

VALUES('jack',T23’,(Exec(char(0x73687574646f 

776e));~); 

Detection Fields Comment. 

Most of the SQL-Injection can be categorized 
under first order and second order attacks, more 
specifically under the category of Tautology, Union 
Queries, and Piggy backed queries, logically 
incorrect queries, stored procedure, inferences and 
alternate encoding [6]. 
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IV. RELATED WORK 

There are number of code checkers and detection 
tools available to mitigate the SQL injection 
vulnerabilities in both static and dynamic run time 
queries generated through the user input by a web 
application before accessing it to the database 
server[7]. There are hybrid models available to 
detect and prevent the SQL injection in some of the 
models are having limited functionality and scope. 
AMNESIA is a model based approach which 
makes use of dynamic and static analysis to build 
the query model. Identifying the hot spot, Building 
query model, Implement the application and Run 
time monitoring are the important stages in this 
model[8]. It is a completely automated model 
where the queries breaching the model are 
considered as injection and will be blocked before 
execution at the database server. Primary limitation 
of this model is that this technique is depending on 
the accuracy of its static analysis phase. SQL- Prob 
is a proxy server based approach; it dynamically 
recognizes and removes malicious user input [9] . It 
can detect the listed, all the major categories of 
SQL Injection attacks with a very minimum 
resource utilization. Since it is a complete black 
box approach, it does not require any application 
modification or any other types of complex input 
validations. SQL-IDS are a specification based 
technique to detect the exploitation of SQL 
injection vulnerabilities [10]. In this model the 
major focus is on query specific detection and has 
negligible computation overhead. There is no 
occurrence of false positive and false negative. 
SQL_DOM works on an API with type checking 
facilities and uses call Level Interfaces. The 
encapsulation techniques are taking care of 
authentication access to the database server [11]. 
Even though there is appropriate filtering and input 
checking, the model still requires further research 
support against the upcoming sophisticated queries 


demanded by the clients of an application. The 
available tools and techniques identified are having 
its own limitations and weaknesses and cannot give 
complete assurance against the injection attack 
[ 12 ]. 

V. THE PROPOSED HYBRID MODEL 

The proposed Hybrid Model embedded on 
an application Program Interface is an effective 
mechanism to detect SQL Injection before it is 
executed by the Data base server. It is a novel 
template based approach for detection and 
prevention of various categories of SQL-Injection. 
The architecture of the proposed hybrid model is as 
shown in “Fig. 1”. 



Figure 1 . Architecture of the Hybrid model. 

A. Standard Query Template creator and 
Template IStore 

In most of the web applications each page can 
generate multiple database requests and so there 
will more than one legitimate query pattern 
corresponding to those queries. Template files can 
be created by parsing the query based on template 
specification format designed [13]. Template 
specification format is shown in “Fig. 2”. All the 
generated queries with unique identity will be 
stored at the Template repository. 
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Figure 2. Query Template specification format 

B. SQL-Template Mapper 

The proposed approach has the specific 
parsing and mapping procedure to match between 
the template fields corresponding both standard 
query and Injected query later direct the alert 
message/result to the query evaluation engine. Both 
the files are stored in JSON format. The template 
mapper retrieves template contents from the JSON 
file. Sample query format with a unique identity is 
shown in “Fig 3”. Since it is stored in a JSON 
format accessing will be faster, which is an added 
advantage. 

Inpit Qien Teinplile Cratftl wiH Generil Propeitki Plea« reftf JSON ffle ii (k paA... " SQli4Dft«DonPmeiiiion SQLL4DPDAIA diti Input?" 


[ , » , l 'Id 1 ; . : ;] , 'XOS*: [] , *!Cn:C3E': [] , ;] , . 

Kt^Se!H-225Hii-a:d-€36a-a5f<ii*:'. : 

•csis 1 ; , 'JZEY-nH* : | , selec: , ] , '=•; . [] , , 5=L'IS:- 

Figure 3 .Sample query format with a unique identity 

C. Query Evaluation Engine 

The query evaluation engine compares 
the unique Identity created for the standard query 


template and the input query template [14]. If the 
Injection is not detected in the first level it will 
refer to the second level detection. In any level a 
contrast is found, then the malicious query message 
is reported and only the benign query directed to 
the database server[15]. The SQLI-Shield, a Jar 
file, embedded in API allows matching regular 
expressions against columns which are related to 
the user input components in the corresponding 
web application. In such cases, the system procures 
this input from the input query and performs 
validation against the regular expression. The 
SQLI-Shield format can be initiated by giving the 
appropriate template Identity of a intended query as 
shown in “Fig. 4”. 

SQLIAShield shield =new S QLI AS hiel d( " D : ViS QLI AC onfi ^VT emplat e V\st_f db52 9 77 -S 2 SS - 4a7 f-S2eO- 

lbPc23e9a3d4.txt", "D:ViSQLIAConfi ^\0utput ir ) ; 


Figure 4. The SQLI-Shield initialization 

D. Query extractor class 

The Query extractor class used in Java based 
application developed for the hybrid model has 
identified the major categories of specifications as 
shown in “Table 1”. 

TABLE 1: SPECIFICATION IN QUERY EXTRACTOR 
CLASS. 


Sno 

Specification 

type/string 

extracted 

Sno 

Specification 

type/string 

extracted 

1 

Query type 

7 

Global variables 

2 

Joints 

8 

System Variable 

3 

Aggregate 

functions 

9 

Quoted String 

4 

comments 

10 

Input string 
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5 

Special 

characters 

11 

Dependent 

queries 

6 

operators 

12 

Keywords 


E. SQLI Detection Engine 

In this module the SQL Detection Engine 
evaluates the incoming queries with the 
specification template by checking possibility of 
injection and gives the alert messages. Malicious 
queries are blocked, only the benign queries are 
allowed to access the database server [16]. The 
Query Reconstruction module in the proposed 
hybrid model will reconstruct the queries by 
eliminating injections and also rebuilding missing 
portions, if any, and removing injected part of the 
user query [17]. 

F. The Query Reconstructor 

The proposed strategy is to verify the query and 
reconstruct it, if it is possible using the Application 
Program Interface, before actual execution in the 
database server . The important features in the 
proposed approach is automatic creation of 
standard query template with training dataset, 
validation of user input against regular expression 
patterns and reconstruction of injected queries [14]. 
The core of this system is a class named “SQLI- 
Shield” which can be instantiated with a 
parameterized constructor with parameter values as 
standard query template path and an output folder 
path [17]. The status report of rejuvenation process 
is as shown in “Fig. 5”. 


SdiooHrajawtH i toft kSmrUl i 

Mo: Schwiyjajwnt vu ixcwMlj diplopd is 3,2BC nlliitcoadi. 

Mo: inpuud Stoiag: HlKt ♦ ires lop: vUr» isd Pujvoid*' ' ud CjcJuoiip'ir.ovi'; 

Mo: Stated ♦ fra lojis vfatrt Bmw'topKj' ud Pui*ord*'ii8faj' ud Dar_Sut«a'ktii»'; 

Mo: I: InjKtia a SSL StrisgHHUHiMkt * fra login ten Tnmr'iihl' ud humid*' ' ud Jtejutap'ktiT*'; 

Mo: Jrntund: — hIki 4 fra login vt»rt [louH'idsi:' ud Puivord*" ud Jitrjutti* , kuv»'; 

Mo: upend Stria?: HlKt 4 fra login ten r«rsup'itos' ud hiimifr'teuiten' ud rjtrjursp'lctivi'; 

Mo: Stated QgiqHMHHtHH:Mliet 4 fra logia vfctr* Itanup'lupit _1' ud Punord*'JnpetJ’ ud OHrStittP'lctin'; 

Mo: Is Lutetian a SSI StdagHMMWtailCt * fra loga ten "ntnuH'idnu 1 ud kantfr'lttuill 1 ud 7i*r_S:atm«’ictiT» 1 ; 

Mo: h j — m fc — HlKt 4 fra loga ten taap'ifcit 1 ud Pi«vord*'»iraidsa' ud Cn:Jutu*’lttivi'; 

Mo: iaput«d String: HlKt 4 fra login ten SMruae'idiitlaii.M' ud J»sjvc:d='idna|123' ud Shi Stusi*'lct:w'; 

Mo: Stated Sui^HHMMMMutlKt 4 fra loga vfatrc Banua'Jupoy ud Puiwrd»'$upetj' ud Bwjuw'ktin'; 

Mo: Is !a]«ctas a SSI StaagWHtH:Mliet 4 fra loga vhert jsiaup'idaajail.ca' ud ?ujvcrd*'teis}123' ud Oar^Suw'ktin'; 
Mo: hgsiiuud: — HlKt 4 fra loga viicrc CHauH'itaJail.oa' ud Pujverd*’Ldia!12j' ud nitr_St*taj*'Activ« ’ ; 


Figure 5. The status report of rejuvenation process 

VI. THE SQLI_REJUVENATOR.JAR, PACKAGE 
FILE 

In order to get better system availability rather than 
rejecting the query without much validation, the 
proposed hybrid model has a special package file, 
SQLRejuvanre.jar. The constructors and functions 
used in this package file are shown in Table 2. 

TABLE 2. CONSTRUCTORS AND FUNCTIONS IN 
SQLIAREJUVINATE.JAR 


Constructor/F unction 

Description 

SQLRejuvenate(String 

standardSqlString) 

A constructor to initializes 
the standard SQL string 

SQLRejuvenate(String 
standardSqlString, String[] 
regularExpres sion) 

A constructor to initializes 
the standard SQL string 
and array of regular 
expressions for inputs 

SQLRejuvenate(File 
trainingDataSetfile, String[] 
regularExpression) 

A constructor to initializes 
the standard SQL string 
that created from training 
data and array of regular 
expressions for inputs. 

detectSQLIA(String 

inputSqlString) 

A function , detects the 
SQL injection and returns 
true if found 

validateNoOflndependentOrSub 
Queries(String inputSqlString) 

A function checks the 
number of independent or 
sub-queries. Returns true if 
no injection found. 

validateQuerytype(String 

inputSqlString) 

A function, checks query 
type in the order they 
appear. 

Returns true if no injection 
found 

validateUsedTables(String 

inputSqlString) 

A function, checks used 
tables in the order they 
appear. 

Returns true if no injection 
found 

validateColumns(String 

inputSqlString) 

A function, checks columns 
in the order they appear. 
Returns true if no injection 
found 
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validateSystemVariables(String 

inputSqlString) 

A function, checks system 
variables in the order they 
appear. Returns true if no 
injection found 

validateGlobalV ariables(String 
inputSqlString) 

A function, checks global 
variable in the order they 
appear. Returns true if no 
injection found 

validateFunctions(String 

inputSqlString) 

A function, checks 
aggregate or built-in SQL 
functions in the order they 
appear. Returns true if no 
injection found 

validate J oins (String 
inputSqlString) 

A function, checks joins in 
the order they appear. 
Returns true if no injection 
found 

validateSpecialSymbols(String 

inputSqlString) 

A function, checks special 
symbols the order they 
appear. 

Returns true if no injection 
found 

validateOperators(String 

inputSqlString) 

A function, checks 
operators used in the order 
they appear. Returns true if 
no injection found 

validateCommentSymbols(Strin 
g inputSqlString) 

A function, checks 
comment symbols in the 
order they appear. Returns 
true if no injection found 

validateKeywords(String 

inputSqlString) 

A function, checks 
keywords in the order they 
appear. 

Returns true if no injection 
found 

setInputFields(String[] 

regularExpression) 

A function sets regular 
expression for each input 
field in the order they 
appear. 

validateAllInput(String 

inputSqlString) 

A function, checks input 
field values with regular 
expressions. Returns true if 
all inputs are valid. 

detectSQLIAWithReconstruction 
(String inputSqlString) 

A function, detects SQL 
injection and returns 
reconstructed query with 
valid input values if any 
injection found. 


VII. RECONSTRUCTION ALGORITHM 

Reconstruction of query is possible by comparing it 
with Regular expression and by implementing this 
strategy the application system availability can be 
increased. The Reconstruction algorithm is shown 
in “Table 3”. 

TABLE :3 RECONSTRUCTION ALGORITHM 
Algm Reconstruction 1 (InputQuery , ValidQuery) 

1. RegularExpression[] = 
getRegularExpression( V alidQuery) ; 

2. SplitListl [] = 

getQuery Splitter( V alidQuery) ; 

3. SplitList2[] = 


getQuery Splitter(InputQuery) ; 

4. InputList[] = 
getInputExtractor(InputQuery ) ; 

5. For i = 0 to length(InputList)-l 

SplitList2 [] .remo ve(InputList [i] ) ; 

//First Case: When extra queries are inserted and 
removed. 

6. For i = 0 to length(ValidQuery)-l 

If(SplitListl [i] .isEqualTo(SplitLi 

st2[i]» 

intermediateQuery = SplitList2[i] ; 

Else 

If(i< 

length(ValidQuery)-l) Then 


length( V alidQuery) - 1 


For j = i to 


interm 
ediateQuery+= 
ValidQuery [i]; 
Next 

End If 
Return 

intermediateQuery ; 

End If 

Next 

//Second Case: When injection in Input field. 


7. For i = 1 to length(InputList) 

If(validateInputWithRegularExperssi 

on(InputList[i],RegularExpression[i])) 

Validlnput[i] = InputList[i] ; 

Else 

Validlnput[i] = NULL; 

End If 
Next 


8 . 


Temp=0; 

For i = 1 to length(InputList) 

Index= V alidQuery .indexOf(Input 

List[i]); 


Rej uvenateQuery +=inter 
mediate. subString(T emp,Index)+ 
InputList[i]; 

Temp=Index+l; 


Next 
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result shown in “Table. 5”. TABLE 5. TEST RESULTS 
OF VULNERABILITIES FOUND IN CHEAT SHEETS. 


Program 


Mode 


Proposed Template based hybrid SQLI 
detection model 


Vulnerabilities 


False positives 


Schoolmate 

http://groups.csail.mit.edu/pag/ardilla/ 


SQLI 


Webchess 

http://groups.csail.mit.edu/pag/ardilla/ 


12 


SQLI 


Faqforge 

http://groups.csail.mit.edu/pag/ardilla/ 


SQLI 


EVE 

http://groups.csail.mit.edu/pag/ardilla/ 


SQLI 


Geccbblite 

http://groups.csail.mit.edu/pag/ardilla/ 


SQLI 


http://pentestmonkev.net/cheat-sheet/sql- 

iniection/oracle-sql-iniection-cheat-sheet 


SQLI 


28 


http://ferruh.mavituna.com/sql-iniection-cheatsheet- 

oku#LineCommentAttacks 


SQLI 


http://www.easvsoft.com/developer/sql-iniection.html 


SQLI 


http://www.sqliniection.net/union/ 


SQLI 


14 


VIII. PERFORMANCE EVALUATION 

The proposed 

architecture is 

complimentary to many of 
the available model due to 
faster detection and low 
overhead on storage. Using 
the sample cheat sheet of 
test data for malicious input 
query on the web 
applications are tested on the 
Template creator application 
test result as shown in 
“Table 4”. All major 
vulnerable categories of 
malicious queries were 
identified and tested for 
SQL Injection and proved 
that there is 100 % detection 
is possible with the 
proposed model, which shows that the proposed 
system is very efficient as there are no false 
positives reported. 


TABLE 4. TYPES QUERIES TESTED WITH THE HYBRID 
MODEL 


Sno 

Type of Queries 

False 

Positives 

Detection 

1 

Tautology 

0 

100% 

2 

Union Queries 

0 

100% 

3 

Piggy Backed Queries 

0 

100% 

4 

Logically Incorrect 
Queries 

0 

100% 

5 

Stored procedure 

0 

100% 

6 

Inference 

0 

100% 

7 

Alternate Code 

0 

100% 


Performance evaluation of the research work also 
carried by executing the Injected queries shared 
within the applications mentioned in the URL and 
Cheat sheet given in the table using the Template 
creator application of the proposed model. The test 


VIII. CONCLUSION AND FUTURE WORK 

The test result shown in “Table. 4” and 
“Table. 5” proves that Proposed Architecture of the 
Hybrid model on an Application program Interface 
is an efficient and effective approach towards the 
detection of SQL Injection and prevents SQL 
Injection attack. The parsing techniques, storage 
format JSON and Jar files used in the used in the 
template creator application of the model will 
increase the efficiency of the detection technique 
and provides faster processing time. 100% 
detection is possible with this approach, If the user 
input is evaluated by assigning and matching it 
with an appropriate template Identity during the 
implementation of the web application with the 
proposed hybrid architecture. There are no false 
positives reported with the current test data and 
cheat sheet used for evaluation. Due to the 
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expanding trend of attack spectrum, 100% security 
cannot be assured. According to the WASP 
security report still there are security holes through 
which exploiting of confidential information is 
possible, which requires further research , 
development with sophisticated detection 
techniques. 
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Abstract: in this article, we will propose a 
real-time human hand gesture recognition system 
which will perform translations from the sign 
language to the common French language. The 
processes is composed by three basic steps: 

> The detection and extraction of the hand 
pattern characteristics during the image 
stream acquisition, which is obtained from 
an integrated camera. 

> The analysis process, in which the obtained 
characteristics are classified as either a 
recognized sign language gesture or an 
unclassified hand movement. Preset 
characteristics of each effective hand gesture 
are stored locally. 

^ The message-assembling phase: at the end of 
cycle of each iteration of the two previous 
steps, the obtained result is either neglected 
or concatenated with the assembled message 

so far. The message is then displayed. 


Keywords: human-machine communication , gestural 
interaction , French sign language , linked gesture 
recognition. 

I. INTRODUCTION 

R ecently, many human machine interfaces 
have been implemented to ease the user 
experience (such as keyboard, mouse, 
joystick or touch-sensing devices) and 
open possibilities for further technology 

exploitation (3D mouse, virtual reality based 
systems) and push ahead ergonomic specifications 


Tarek Frikha, Abir Hadriche 
CES Lab, REGIM Lab 
Sfax, Tunisia 


to facilitate operating these systems for specific 
user categories. 

Several experiences in operational state are proved 
their incompleteness for specific cases, which occur 
rather frequently, especially when used by people 
with special needs. 

Therefore, researches were oriented to satisfying 
more natural, specific and richer interactions, which 
is the case for gestural HMIs. 

Gestures researched were mainly head 
gestures, pupil gestures and commonly hand 
gestures. Therefore, gestural servitude was founded. 

Hand gesture is the one best and most expressive 
processing friendly human movement with its 
distinguished organ shape and wide range of 
posture possibilities. Thus, a gesture has an exact 
and rich expression. 

The hand Sign Language is the most expressive 
evidence; it indicates the robustness of the hand 
gesture expression and its importance in 
communication, which competes with Speech 
interactions in fulfilling the purpose of the 
communication. 

This type of communication, easy to be 
comprehended by humans, however it have many 
difficulties to be implemented bythe process 
technologies, needs an acquisition and processing 
tool to make it. 
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This tool should provide real time sensing via an 
external sensor (such as cameras, biometric sensors 
or digital censoring gloves). 

In addition, deaf people would not always 
dare to insist on describing their needs, expose the 
best practices to follow to ensure better and easier 
communication and especially will not insist on re- 
expressing their thoughts due to possible 
psychological issues. 

From there, we issued our idea to implement our 
hand gesture recognition system to help support 
communication for those who find obstacles within 
it. 

II. RELATED WORK 

Several applications/libraries have been 
developed to perfect the hand gesture recognition 
for the common purpose of recognizing the hand 
“object”, thus, we find a wide range of utilization in 
this objective. 

Some of the developed HMI [1] benefices 
of the electrical gloves equipped with inflection 
sensors to ensure recognizing the hand posture 
allowing freedom of positioning relatively to the 
system in the same time, offering also more 
flexibility of the hand orientation and the 
movement direction and solves radically the 
problem of left handed / right handed person 
situation. 

Others [2] propose an alternative which 
facilitate hand shape detection: the use of specially 
colored gloves to standardize the anticipated color 
of the hand rather than processing the skin color 
which is very variant and subject to many different 
conditions (therefore, especially solving the 
problem of race difference) 

In addition, there was developed the “ hand 
to machine interface device[ 3] that provides real- 
time gesture , position and orientation information. 
The key element is a glove and the device as a 
whole incorporates a collection of technologies. 
Analog flex sensors on the glove measure finger 
bending. Hand position and orientation are 


measured either by ultrasonic , providing five 
degrees of freedom, or magnetic flux sensors, which 
provide six degrees of freedom." 

Finally, there were many applications 
developed to improve and make the challenge to 
recognize the bare hand directly just by relying on 
multi-level complex algorithms, which was the 
hardest but most effective solution, just because it 
offers the most amount of ergonomics to the user, 
which again, is the main purpose of implementing 
these systems generally, and our system 
specifically. 

III. THEPROPOSEDSOLUTION 

The solution provided by this system is 
composed of three main steps, which are divided to 
sub sequences and composites. 

The first step is the acquisition phase, a 
real time image stream transfer is provided from an 
integrated live camera, which is supposed to engulf 
the gesture material (the user hand). 

The second step is the analysis and 
classification phase and it is mainly composed by 
two sub-tasks: 

The analysis: each frame of the image stream 
received was segmented then we start the extraction 
of specified characteristics. 

The classification: when having the 

characteristics on hand, the system could tell 
whether the scene is containing an impression of an 
effective (significant) human hand signal or not 
based on the specified presentcharacteristics stored 
on the integrated memory. 

In the favorable case, the system stores the 
equivalent significance and reiterates until given the 
order to pass to other message or halt (detected 
automatically in another level of the algorithms) 

The last stepconsist of assembling, correction and 
outputting the message accumulated by iterations of 
phase two. 
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Fig. 1. Hand gesture recognition system 

A. the data base 

Animage database containing the 
effective messages which the comparison 
process will use while recognizing the 
received messages. 

Same example of picture the database: 



00 oo 



<0 <d) 



<«) (D 

Fig 2. (a) the alphabet A,(b) the alphabet I,(c)the alphabet B,(d) 
the alphabet L,(e) the alphabet V,(f) the alphabet U 
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After making the segmentation of these images and 
saves in a memory. 






Fig.3. The segmented images 
B. hand shape detection 

For the hand detection, we are using the 
background subtraction method provided by the 
OpenCV Image Processing Library. 

Given the situation, the system will operate 
on the hand detection phase's main purpose is to 
identify whether or not the image is containing a 
human hand, and if it ‘is the case, what gesture is it 
performing. 

To do this, we need an algorithm that gives 
the segmentation, the shape recognition and the 
similarity detection of shapes contained in the scene 
to decide if it contains a hand or doesn’t. 


Fig. 4. hand detection 


CONCLUSION 

Till the present moment, we have 
succeeded to implementing the image characteristic 
database and the hand detection module, yet the 
comparison process is still under development and 
we are finding difficulties in calibrating the results 
of the comparison. Most of the occurrences of the 
process does not recognized, the good 
characteristics and classifies the input as an 
ineffective message. 

In the present time, we are still developing 
the system and we are trying alternative algorithms 
to surpass this issue. 


452 https://sites.google.com/site/ijcsis/ 

ISSN 1947-5500 




International Journal of Computer Science and Information Security (IJCSIS), 
Vol. 14, No. 6, June 2016 


REFERENCES 

[1] Thomas Baudel, Annelies Braffor”: GESTURE 
RECOGNITION OF HAND IN REAL 
ENVIRONMENT. 

[2] Thomas Burger “ : Automatic gesture recognition of 
spoken French language complete. 

[3] dl.acm.org.sci-hub.io/citation.cfm?id=275628 

[4] anikettatipamula.blogspot.ro. 

[5] www.mathworks.com 

[6] Nicolas Mollet, Ryad Chellali: Detection and 
interpretation of the Hand Gestures. 

[7 ]B ERR ACHED Chahrazed: hand gesture recognition 
system. 

[8] Annelies BRAFFORT: recognition and understanding 
of motion, pursuant to sign language. 

[9] Klimis Symeonidis: hand Gesture Recognition Using 
Neural Networks. 

[10] Simena86.github.io 

[11] http://www.eodeforge.eom/s/0/hand-gesture- 
recognition 


453 


https://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



International Journal of Computer Science and Information Security (IJCSIS), 
Vol. 14, No. 6, June 2016 

An Optimization Technique for Brain Tumour Recognition 

Dr. D. Rajya Lakshmi ’, Shaik Salma Begum 2 

Vice-Principal, Professor of CSE, UCEN, JNTUK Narsaraopet 1 , 

Research Scholar, CSE Dept, JNTUK, Kakinada 2 


Abstract 

In this paper, we have proposed a robust technique to detect 
and classify the tumour part from medical brain images. In 
recent times, a number of image segmentation and detections 
techniques have been proposed in the literature. But, the 
detection of brain tumour through the help of classification 
technique has received significant interest among the research 
community. By considering the above issue, here, we combine 
three different techniques such as, cuckoo search, neural 
network and fuzzy classifier to detect the tumour part 
effectively. Our proposed approach consists of four phases, 
such as, pre-processing, region segmentation, feature 
extraction and classification. In the pre-processing phase, the 
anisotropic filter is used for reducing the noise and in the 
segmentation process; K-means clustering technique is 
applied. For the feature extraction, the parameters such as 
contrast, energy and gain are extracted. In classification, a 
modified technique called Cuckoo-Neuro Fuzzy ( CNF) 
algorithm is developed and applied to detection of tumour 
region. In the modified algorithm, cuckoo search algorithm is 
employed for training the neural network and the fuzzy rules 
are generated according to the weights of the training sets. 
Then, classification is done based on the fuzzy rules generated. 
Experimental results shows that the proposed technique 
achieved the accuracy of 79.49% but existing technique 
achieved only 76.92%. 

Keywords: CNF, contrast, energy, entropy, K-Means, 
anisotropic filter, sensitivity, specificity, accuracy 

1. Introduction 

Image segmentation theory, as digital image processing has 
become an important part of people active research. It is 
pertinent to note here that Image segmentation is a sine-qua- 
non of medical image processing and finds itself extensively 
applied in manifold and varied tasks [1] [14]. In addition, 
medical image segmentation casts an amazing part in the 
treatment planning, identifying tumours, tumour volume, 
patient follow up and computer guided surgery. There is a 
flood of varied methods for performing the function of 
medical image segmentation [3]. In addition the underlying 
objective of segmentation is to segregate an image into diverse 
components possessing robust correlation with domains of 
concern in the image. As far as medical image processing is 
concerned, segmentation of MR brain image is a noteworthy 
feature as MRI is predominantly proper for brain 
investigations as it bristles with brilliance in view of its superb 
distinction of soft issues, non invasive characteristic and the 
added advantage of a high spatial resolution. As a result, 
segmentation of tissues and structures from medical images is 
treated as the foremost action in several image assessment 
techniques launched for medical diagnosis [4]. It is simply 
imprudent to contrast the Manual segmentation of the 
abnormal tissues with the hi-tech swift computing systems 


throwing open the facility to visually monitor the volume and 
locality of unwanted tissues [8]. 

Consequently, pre-processing is done to improve quality of 
image; image pre-processing involves different techniques to 
improve image quality before actual segmentation process. It 
removes irrelevant information like noise and enhances 
contrast to improve image quality. The diverse pre-processing 
functions employed include Histogram Equalization, 
Binarization, and Morphological Operations. Afterwards, the 
feature extraction procedure assumes significance involving 
crucial stages, where traits tend to be the characteristics of the 
objects forming part of an image. Feature extraction is the task 
of mining definite features from the pre-processed image. 
Nowadays, many diverse methods are employed for estimating 
texture like co-occurrence matrix, Fractals, Gabor filters, 
wavelet transform. Gray Level Co-occurrence Matrix (GLCM) 
features are extensively utilized to break-up regular and 
irregular brain tumours. GLCM is the abridged form of gray- 
level co-occurrence matrix (GLCM), otherwise termed as the 
gray-level spatial dependence matrix) [3] [9]. 

K-means clustering is an appropriate method for 
biomedical image segmentation as the quantity of clusters is 
generally identified for images of particular regions of the 
human anatomy. A number of experimenters have launched 
associated investigations into K-means clustering 
segmentation. Though a significant and noteworthy 
advancement has been made in this regard, still there is greater 
computational intricacy and the need for superfluous software 
functionality [6]. Clustering programs, like k-means and 
ISODATA, function in an unsupervised mode and have been 
performed on an extensive domain of categorization dilemmas 
[7]. For categorizing the tumour segments, physical 
classification tends to lead to manual flaws, in addition to 
relying heavily on person to person, protracted and elongated 
runtime along with non-reproducible outcomes. Therefore, an 
automatic or semi-automatic classification technique is the 
need of the hour as it tends to scale down the burden on the 
individual spectator, and also because accuracy does not 
become the casualty on account of exhaustion and mammoth 
quantity of images [3]. In respect of tumour detection, several 
schemes such as, K-NN, bayes classifier, neural network, 
fuzzy classifier are performed for automatic detection. When 
comparing with these methods, Neuro-Fuzzy is found to be 
better and this technique has been used in a lot of research 
areas. 

2. Motivation of the Proposed Approach 

Segmentation is a significant technique used in image 
processing to detect objects in an image. As same as, MRI 
Image segmentation plays a critical role in many medical 
imaging applications. In accordance with brain tumour 
segmentation and detection, numerous significant algorithms 
and methods are published in this area. Some of the recent 
related works regarding the segmentation are reviewed and its 
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limitation and application are tabulated in the table- 1. They 
developed a framework for multi-object segmentation of deep 
brain structures in medical brain images. Deep brain 
segmentation is difficult and challenging because the 
structures were small size and have significant shape 
variations. To tackle these problems, they proposed a 
template-based framework and Markov dependence tree 


methods [13], which were used to segment the deep brain 
structure. However, like most segmentation problems, tumour 
detection and quantification of brain tumour was very difficult. 
Also, A.K. Qin [14] and Tao Wang et al. [15] have developed 
a vector flow method to overcome the gradient vector flow, 
boundary vector flow, and magneto static active contour, but it 
has the limited range only. 


Table 1: Summary of related researchers 


Author 

Description 

Application 

Limitation 

Jue Wu, Albert 
C.S.Chung [13] 

Template based 
Frame Work and 
Markov dependence 
tree 

Segmentation 
of deep brain 
structures 

Large Training set 
is available, 
Difficult to do. 

Tao Wang et al.[ 15] 

Fluid vector flow 

Brain tumor 
segmentation 

Limited capture 
range, poor 
convergence 

Zafer Iscan et al. [16] 

2D Continuous 
wavelet Transform 

Segmented 
magnetic 
resonance brain 
images 

It have noise 

Minakshi 

Sharma, Dr. Sourabh 
Mukharjee [3] 

Adaptive Neuro- 
Fuzzy Inference 
System (ANFIS ) 

Brain Tumor 
Segmentation 

Does not measure 
thickness and 
volume of tumor 

Jayashri Joshi, 
Mrs.A.C.Phadke [12] 

Statistical structure 
analysis based tumor 
segmentation 

Semi- 
Automated 
MRI brain 
tumor 

segmentation 

It is difficult to do 

M. Rakesh, 
T. Ravi [6] 

color based 
segmentation 
method and FCM 
algorithm 

Detection of 
Tumor Objects 
in MRI Brain 
Images 

It have average 
speed 

Reza Farjam et al. [17] 

An approach for 
computer-aided 
detection 

Brain 

metastases in 
post-Gd Tl- 
weighted MRI 

Its only designed 
to localize small 
brain metastatic 
lesions 

Our method 

Cuckoo-Based 

Neuro-Fuzzy 

Classifier 

Brain tumour 
segmentation 
and detection 

N/A 


On other hand, for segmenting the brain tumors in magnetic 
resonance images, a technique has been proposed by Zafer 
Iscan et al. [16]. There, tumour identification was done by 2D 
Continuous wavelet transform. But, during tumour 
identification process noise occurred, which is the main 
limitation of their proposed method. Moreover they visually 
demonstrated brain metastases in post Gd-Tl -Weighted MRI 
using CAD. Reza Farjam et al. [17] developed an approach it 
designed to localize small brain metastatic lesions. Key 
problem in medical imaging was automatically segmenting an 
image into constituent heterogeneous process. Jayashri Joshi 
and Mrs.A.C.Phadke [12] have performed Semi -Automated 
MRI brain tumor segmentation. However, no completely 
automatic segmentation has yet been adopted. To solve these 
challenging problems, also, various methods proposed in the 
literature have met with only limited success due to 
complexity of feature extraction and classification. M. Rakesh 
and T. Ravi [6] have developed segmentation technique for 
brain temporising fuzzy C-means algorithm. The segmentation 
performed only average speed in their method. Minakshi 
Sharma and Dr.Sourabh Mukharjee [3] developed an approach 
for segmenting the brain tumor using Adaptive Neuro-Fuzzy 


Inference System (ANFIS) to overcome the fuzzy C-means 
algorithm. But, in their method is difficult to find the volume 
of the tumour. 

By considering the above challenges and to improve the 
tumour segmentation and classification limitations, an efficient 
approach is urgently needed. In this paper we present a new 
method for brain tumour segmentation and detection using 
cuckoo-based neuro-fuzzy classifier. The results of this 
approach are used to four efficient algorithms for automatic 
tumour detection and classification. To prove this point, totally 
four different phases are developed. Then, performance of the 
procedure is tested on different brain MRI images. The rest of 
the paper is organized as follows: Introduction of proposed 
technique is presented in 1. Motivation of the paper is 
described in section 2. Contribution is discussed in section 3. 
The proposed tumour detection and classification system is 
described in section 4. The experimental results and 
performance evaluation discussion is provided in Section 5. 
Finally, the conclusions are summed up in Section 6. 
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3. Contribution of Proposed Technique 

• The novelty and contribution of the proposed method is 
that we introduce a Cuckoo-Neuro fuzzy (CNF) classifier 
to detection of tumour region. In NFC classifier, cuckoo 
search algorithm is employed for training the neural 
network and the fuzzy rules are generated according to the 
weights of the training sets. Then, classification is done 
based on the fuzzy rules generated. 

4. Proposed Brain Tumour Segmentation and 
Detection Technique Using FCM and CNF 
Classifier 


Segmentation of brain tumours from MR images is a 
difficult task that involves a range of disciplines covering 
pathology, MRI physics, radiologist’s perception, and image 
analysis based on the intensity, shape and size. It is 
unfortunate that various critical hassles and tribulations 
habitually haunt and put roadblocks against the appropriate 
segmentation of brain tumours. As per the recent data released 
by the World Health Organization (WHO), it is estimated that 
a whopping number of over 400,000 people per annum 
invariably resort to intensive medical care for the purpose of 
treating the catastrophic and cruel brain tumours. These 
tumours, in fact, exhibit a unique tendency to appear in diverse 
ways such as in shape, size and location, and they have an 
uncanny way of gate-crashing into diverse places with 
dissimilar intensities. This has thrown open unfathomable 
challenges before the medico-community in their intensive 
efforts to locate the accurate tumour which has adversely 
affected the cells in the brain. It is high time we recognize the 
supreme significance of the precise segmentation of brain 
tumours and initiate instant and effective steps for the purpose. 
In essence, Brain tumours fall in to two different groups which 
may be classified as initial benevolent tumours that have not 
broadened their domain of destruction to other localities and 
the advanced or malignant brain tumours that have already 
cast a havoc by gradual swelling and found a way from other 
parts of the physique ultimately winning to reach the brain. 
Patients supposed to suffer severely from tumours go through 
various rigorous steps including diagnostic CT scans and MRI 
in super-speciality hospitals. Although, the radiologist 
performs these diagnoses, it is very difficult to identify a 
tumour in the brain due to the involvement of various 
abnormalities, noise and intensities. Various methods 
proposed in the literature have met with only limited success 
due to complexity of classification. Considering the above 
issues, in this paper, we have proposed a technique to detect 
and classify the tumour region from the brain MRI dataset. 



Figure 1: Overall flow diagram of the proposed tumour 
segmentation technique 


As depicted in figure 1, four main phases of the proposed 
tumour segmentation algorithm are described in the following 
section: 

> PhaseJ.: Pre-processing 

A pre-processing phase in image segmentation works is 
used to remove unwanted noise from the brain images. 
Here, anisotropic filter is utilized to remove noise 
from the MRI images. 

> Phase 2 : Segmentation of Region 

The second phase detects the region of brain images using 
K-means clustering algorithm. 

> Phase 3 : Feature extraction 

In this phase, feature parameters such as contrast, energy 
and gain are extracted using segmented regions. 

> Phase 4 : Classification 

Finally, in this phase, a Cuckoo-Neuro Fuzzy algorithm is 
developed and used to detection of tumour region. 

In the following subsections, we describe our proposed 
tumour segmentation system by first introducing the pre- 
processing steps and then detailing the tumour detection and 
classification approach. 

4.1 Pre-Processing 

Pre-processing is the standard procedure in general brain 
image segmentation, aiming to reduce image noise. Here, an 
anisotropic filter is used to smoothen the MRI image and 
reduce the noise. Each input MRI images is performed to the 
noise removal process and given to the region segmentation 
process. 

4.2 Segemenation Using K-Means Algorithm 

In this stage, region is segmented from the pre-processed 
MRI brain image by means K-means clustering algorithm. 
After we have achieved success in extracting the region, the 
recognition is carried out by means of feature extraction and 
classification technique to categorize it as either normal or 
tumour. K-means clustering segments the concerned MR 
image into two specific regions. The former region comprises 
the normal brain cells where as the second region is composed 
of the timorous brain cells. K-means clustering segments the 
brain MR image in accordance with intensity pixels 
constituting the image. K-means is considered as one of the 
significant unsupervised learning algorithms in respect of 
clusters. Clustering the image is grouping the pixels according 
to the some characteristics. It is nothing but just to cluster the 
items into k number of clusters according to certain features. 
The main target of K-mean clustering is to categorize the data 
by minimizing the sum of squares of distances between data 
and the corresponding centroid of the cluster [18] [19]. In this 
case, K-means clustering is employed to group the pixels into 
two distinct clusters ( k = 2 ).The detailed step-by-steps of K- 
means clustering algorithm is described as follows: 

1) Give the number of cluster value as k . Here, we have 
chosen/: = 2 . 

2) Randomly choose the k cluster centers. 

3) Calculate mean or canter of the cluster 

M _ ix(j)=k_'k - 12,...K (1) 

N k 
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4) Next to that, the pixels of the image are assigned to 
the closest cluster which satisfies the minimum 
Euclidean distance from the pixels values to the 
center of each cluster. 

D{i) = argmax||x ; - - M k || 2 ,i = \,...,K (2) 

5) If the distance is near to the center then move to that 
cluster. 

6) Otherwise move to next cluster. 

7) Re-estimate the center. 

8) Repeat the process until the center doesn't move. 

4.3 Feature Extraction 

Feature extraction is an important stage of image 
segmentation process and which is used to compute a 
characteristic of a digital image able to numerically describe 
its texture properties. After region segmentation, we are 
considering varying features for the tumour image 
classification. 

FV={F u F 2 ,F 3 } (3) 

These features are calculated for two segmented regions in 
each MR image such as tumour and non-tumour and the 
feature vector which we have formulated is 

FV = \f, T , F x NT , F T 2 , F 2 NT , F* , F 3 NT } ( 4 ) 

Where, F X T -> Contrast feature set of tumour region 
F } nt -> Contrast feature set of Non-tumour region 
F 2 -> Energy feature set of tumour region 
F 2 nt -> Energy feature set of Non- tumour region 
F 3 -> Entropy feature set of tumour region 

F 3 NT -> Entropy feature set of Non-tumour region 
The feature vector FV is calculated by following features: 

• Contrast : 

The contrast (C) feature is defined as the divergence moment 
of the P matrix and constitutes a significant measure of the 
contrast or alternatively the amount of local variations present 
in an image. The formula for the estimation of the contrast is 
given below: 

G-1G-1 

p\ = c = X X 17 - •/) 

i= 0 7=0 

• Energy : 

Energy (E) is generally employed to express a measure of 
information in an image. The formula for determination of the 
energy is furnished as follows: 

G-lG-l 

( 6 ) 

i= 0 7=0 

• Entropy . 

An entropy (H) measure is described as a significant statistical 
measure of randomness which is employed to distinguish the 
texture inherent in the candidate region. Moreover, entropy is 
capable of enabling us to judge the vicinity of the pixel 


appropriately leading to further precision in the categorization 
of the texture 

G-lG-l 

F 3 =H = ^ p(i, j) log 2 [p(i, j)] (7) 

;=o y=o 

The extracted features in 3.12 and the features are given to 
a cuckoo based neuro-fuzzy classifier to accomplish the 
classification process. 

4.4 Tumor Detection Using CNF Classifier 

In this section, the extracted feature set 
FV = {F‘ , F" t , Fj , F 2 nt , Fj , F" t } is given to the CNF 

classifier. In the CNF classifier, cuckoo search algorithm is 
employed for training the neural network and the fuzzy rules 
are generated according to the weights of the training sets. 
Then, classification is done based on the fuzzy rules generated. 
Section 4.4.1 describes best rule generation process using 
cuckoo search algorithm. Section 4.4.2 describes classification 
using neuro-fuzzy classifier. 

4.4.1 Best rule generation using Cuckoo search 

Our aim of this section is to generate best rules and these 
rules are provided to fuzzy logic system. Here, cuckoo search 
algorithm [20] [21] is utilized to generate best rule and these 
best rule is given to the further process. Cuckoo search 
algorithm is an optimization algorithm and developed by Yang 
and Deb in 2009 and has undergone a substantial 
development. This method is very different from other meta- 
heuristic optimization algorithm. The detailed process of the 
generating the best rules using cuckoo search algorithm is 
explained using the following section, 

> Discretization: 

Before the cuckoo search process, initially, the training 
dataset DS TR , which consists of "A" number of attributes, is 
provided to the discretization function to relocate the input 
records into a discretized one. The generalization form of the 
training dataset is expressed by: 

DSjji = {dsij ;0 <k <m and 0 <1 <n (8) 

Discretization is a vital step in data processing to transform 
the data or records into specific interval. In this case, we have 
utilized to an innovative discretization method following the 
conservative manner. The utmost and least values of each and 
every attribute are located and the T interval is traced by 
consideration the relation between the deviated value 

and7^ /7 value. 

For each and every / , deviated value is estimated as 
follows: 

Devi = Majfjdsi)-rmn(dsi) ^ 

DSi VL = min (dsi ) < (min(<A/ ) + Devi ) (10) 

DS i L = (min((A/ ) + Devi ) < (min(<A/ ) + 2 * Devi ) (11) 

DS t M = (min (dsi ) + 2 * Dev t ) < (min(^/ ) + 3 * Dev x ) (12) 

DS i H = (min (<A/ ) + 3 * Devi ) < max(<A/ )/ (13) 
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Where, VL -> Very Low, H High, M -> Medium, L-> 
Low 

Then, every value that comes under within the range is 
replaced with the interval value so that the input data is 

transformed to the decretized data DS TR . After discretization 

function, the training dataset DS TR is converted into 

discretized format DS d . Where, the entire data element 

DS d (k, Z) contain only the VL, L, M or H if T = 4 . 

> Generating initial set of nests: 

At the outset, 4 n ’ number of nests are engendered and each 
and every nest is endowed with the ensuing rules which can be 
broadly detailed as Very High (VL), High (H), Medium 
( M ), Low ( L ) and one class ( C ). Here C corresponds to 
class (whether tumour or non-tumour). The population of nest 
4 n ’ is supplied to the client along with the dimension 

(attributes) of the each and every nest R i (f i ,C) forming part 

of the image feature dataset. In other words, 4 n ’ solutions are 
furnished in a preliminary group of host nests, and each and 
every nest stands for the corresponding features. Where, f t is 
the number of features, in which 1 represents Very High 
(VL ), 2 represents High ( H ), 3 represents Medium ( M ), 4 
represents Low (L) and C -> class. The initial solution and 
solution encoding process is depicted in figure 2. 


new remedy X" t+V> for cuckoo i is produced by employing a 
Levy flight along with the following equation: 

X (t+l) = X? + a UvyW (4) 

Where, a (a > 0) symbolizes a step scaling size. This 
parameter must be connected to the scales of issue the 
algorithm is trying to locate a key to. In almost all the cases 
(X can be fixed to the value of lor a specific dissimilar 
constant. 

Bust rule generation : From cuckoo search algorithm, logical 
rules, represented as R = {Rj ;1 < j < m} are derived by 

performing several iterations. Here, the rules should have two 
different decisions such as, 1 and 2. From the cuckoo search 

algorithm, best rules R best are generated and given as figure 3: 



Figure 3: Best rules from cuckoo search algorithm 

4.4.2 Classification using Neuro-Fuzzy: 



Figure 2: Solution encoding process 


> Fitness calculation: 

We compare the outcome result with the training and testing 
dataset and we calculate the accuracy through the following 
equation (14) as fitness function for each nest. 

Fitness = 

(14) 

sum of rule Rj in the discretize d dataset DS D 
Where, 

DS D -> discretized format data 
Rj -> Rules 


> Nest updation: 

At this point, an arbitrary number (j) is created by using levy 
flight and the comparative remedy is chosen. Subsequently, 
the fitness of nest located in the initial group of nest 
corresponding to the arbitrary number is replaced by means of 
a new finest nest. When the estimation of the fitness of the 
initial remedies is over, newest remedy is found out in 
accordance with the cuckoo operator. Based on the modifiable 
Levy flight, the cuckoo operator generates new remedies. A 


Generation of fuzzy score using fuzzy system : The NFC is a 
multi-layer feed forward network which comprises the ensuing 
levels. The fuzzy interference system performs three dynamic 
functions as detailed below: 

• Fuzzification 

• Rule Evaluation 

• Defuzzification 

Fuzzy inference is the unique procedure of generating a 
mapping from a prearranged input to the resultant output by 
the employment of a fuzzy logic. Thereafter, the mapping 
heralds a foundation and from this foundation appropriate 
decisions can be taken, and the patterns can be distinguished. 
The key task of fuzzy inference involves Membership 
Functions, Logical Operations, and If-Then Rules. The 
schematic graph of the fuzzy inference system (FIS) is vividly 



Figure 4: Fuzzy Inference System Structure 


• Fuzzification 

In fuzzification process, the crusty quantities are changed into 
fuzzy. In our proposed method, the fuzzification process is 
carried out by employing the features that are extracted in 
section 4.3. 
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The extracted features are F 7 , F^ T , F 7 , F^ T , F^ and F^ T , 

for each feature we perfrom fuzzification process. For the 
fuzzification process, we collect all the 

F 7 , F] NT , f[ , F 2 iV7 , F 3 7 and F 3 NT features of the training 
images and compute each feature minimum (min) and 
maximum (max) values. The fuzzification process is 
performed following equations. 

[FL\ ] MinLimit = m in+ ^ max ~ min 

[FL\ - Max Limt + ^ max ~ min j ( 17 ) 

In above equations [FL[ ] Mln LimU and [FL] ] MaxLimit are 

the minimum and maximum limit values of the feature F \ T . 
The same equations are used for the features 
F 7 , F x nt , F 2 t , F / 7 , F 3 7 and F ^ 7 to compute the 
minimum and maximum limit values. 

• Fuzzy Membership function: 

The membership function of each and every input is 
recognized in this stage. The membership function is planned 
by selecting the appropriate membership function. One of the 
prominent challenges in all fuzzy sets involves the appropriate 
decision of fuzzy membership functions, 

1) The membership function discharges its task 
efficiently by performing the complete demarcation 
of the fuzzy set. 

2) A membership function furnishes an assessment tool 
for estimating the level of resemblance of an element 
to a fuzzy set. 

3) Membership functions may assume any shape; 
however there occur certain general patterns which 
tend to emerge in bona fide applications. 

• Rule Evaluation 

Using cuckoo search algorithm, we already generated the 
fuzzy rule set Rt>est - {R J best 'A ^ j <m-T,} that are given in 
the fuzzy rule base. The rule base contains a set of fuzzy rule 
in the form of Figure 3. 

Neural network process : After the fuzzy interference process, 
the fuzzy score is generated and assigned to the neural 
network output parameter. Totally, we have assigned two 
output classes (parameter), (i) fuzzy score (ii) original feature 
set. The neural network is well trained with these extracted 
features and different number of unknown brain MRI images 
is tested. The important steps involved in neural network are 
as follows, 

Step 1: Put the input weights to every neuron except the 
neurons in the input layer. Here, 

F 7 , F { nt , F 7 , F 2 nt , F 3 7 and F^ T are the input features such 

as contrast, energy, entropy for the tumour and non-tumour 
segmented region i.e. input of the network and ( C k )output is 
the decision result from the FIS and original feature set, i.e. 
output of the network. The neural 

Step 2: The neural network is designed with six input layers, 

H i hidden layer, and two output layer. The weights and then 
added to the neural network and it is biased. 



Step 3: To the output layer the output of the activation 
function /(ln(//;)) is then broadcast all of the neurons: 

N 

0 Ci)output = r/ k +'^ j W 2n iCi(n) (18) 

n = 1 

Where ift and T] k are the biases in the hidden layer and the 
output layer. 

Step 4: Compute the error between the desired 

output (C k ) Tavget and the output (Q ) output produced by the 

feed-forward neural network, this is given by 

F v = {Ck)t arg^t - {C^)output (19) 

In equation (19) (Charge* -is the target output and 

( c k)output -is the network output 



Figure 5: Proposed neural network structure 


In testing phase, the input testing feature [Fj 5 F 2 , F 3 ] test is 
given to fuzzy interference system and corresponding fuzzy 
score is generated. This fuzzy score is given to the neural 
network. The resultant value of neural network’s output class 
is represented as Aj and A 2 , and this value is compared with 


threshold value T x . 

(Abnormal; A] > 7i 

Re suit = \ (20) 

[ Normal ; A 2 < T\ 

In this way the brain MRI images are classified into normal 
and abnormal. 


5. Simulation Results and Discussion 

This section presents the results obtained from the 
experimentation and its detailed discussion about the results. 
The proposed tumour detection and classification technique is 
experimented with the brain MRI image dataset and the result 
is evaluated with the sensitivity, specificity and accuracy. 

5.1 Dataset Description 

The dataset contains the MR images acquired from the 
internet. The proposed method was tested with different MR 
images with different shapes, sizes and intensities. A dataset of 
60 images (40 normal and 20 abnormal) has been developed to 
the test performance. This image dataset contains 60 brain 
MRI images which includes tumour and without tumour brain 
images as shown in figs 6 . The Brain image dataset are 
divided into two sets such as, (1) Training dataset (2) Testing 
dataset. To segment the brain tumour images the training 
dataset is used and to analyze the performance of the proposed 
technique the testing dataset is used. In this novel technique, 
training and testing images are altered in various ratios like 
60/40, 70/30, 80/20 and 90/10 for the purpose of testing. The 
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Figure 6 shows some of the sample MRI images with tumour images and non-tumour images 



Figure 6: Non-tumour and Tumour images 


5.2 Experimental Setup and Evaluation Matrices 

The proposed technique is performed in a windows 
machine having configurations Intel (R) Core i5 processor, 
3.20 GHz, 4 GB RAM, and the operation system platform is 
Microsoft Wnidow7 Professional. We have used mat lab latest 
version (7.12) for this proposed brain tumour detection and 
classification technique. 

The evaluation of proposed technique in different brain 
MRI images are carried out using the following metrics as 
suggested by below equations, 

Sensitivity = 
number of true positives 

number of true positives + number of false negatives 
Specificity = 
number of true negatives 

number of true negatives + number of false positives 


Accuracy = 

number of true positives + number of true negatives 
number of true positives + false negatives + true negatives + false positives 

5.3 Experimental Results 

Nowadays, manual segmentation of brain tumour from MR 
images has emerged not only as a thorny issue but also as a 
time consuming function. The proposed tumour detection 
technique is endowed with the faculty of efficiently 
segmenting a tumour once the parameters are laid down free 
of fault. The underlying objective of the proposed technique is 
targeted at facilitating the tumour detection in brain images 
irrespective of whether they are affected by tumour or not. The 
test outcomes yielded by the novel method are furnished in 
table 1. The table 1 demonstrates the original image and 
corresponding filtered and segmented images for tumour and 
non- tumour images. Table 2 depicts the three features results. 


Table 2: Segmented results of proposed technique 

Segmented Image 







Y 

s 
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Feature Extraction Results: 


Table 3: Feature extracted results of proposed technique 


Input images 


v 


A 




Contrast 

Energy 

Entropy 


0.8014 

651173 

1.2102 


0.6946 

934521 

1.4533 


0.7031 

783346 

1.1985 


0.7956 

712774 

1.3016 


0.6978 

806406 

1.2333 


0.7767 

711766 

1.2853 


0.7935 

695191 

1.2769 


0.7318 

712434 

1.1348 


0.6726 

826969 

1.1489 


0.7702 

703274 

1.2571 


0.8153 

593832 

1.1076 
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5.4 Comparative Analysis 

In this section, we will indicate sensitivity, specificity and 
accuracy achieved by the proposed brain tumour detection 
technique while segmenting and classifying the brain MRI 
images and we use three evaluation matrices for comparing the 
performance of our method CNF classifier. Also, we have 
compared against K-means with neuro-fuzzy classifier and 
proved our proposed tumor detection system is better 
performance with help of sensitivity, specificity, accuracy. 
The evaluation results of the proposed against existing 
technique graphs are figure 7 to 9. In figure 7, the proposed 
approach achieved the sensitivity of about 96.2% where 
existing approach achieved only 8% in training-testing ratio 
(70-30). In figure 8, the proposed technique achieved the 
specificity of 79.49% where existing approach achieved only 
73.53% in training-testing ratio (80-20). In figure 9, the 
proposed approach achieved the accuracy of about 79.49% 
where existing approach achieved only 76.92% in training- 
testing ratio (90-10). Totally, the proposed tumour detection 
technique is achieved better performance when compared 
existing technique. 



Figure 7: Sensitivity graph of proposed against existing 
technique 



Figure 8: Specificity graph of proposed against existing 
technique 


0.85 

;P.8 
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Figure 9: Accuracy graph of proposed against existing 
technique 


5.5 Severity Analysis 


In severity analysis, we have taken seven images, which are 
tumour images as shown in table 4. Here, we have used pixel 
based similarity matching for proposed approach tumour 
image and manual segmented tumour image. Here, we have 
used Jaccard coefficient for similarity matching which is 
following by equation (21), 


J(A,B) = 


|AflS| 

|AUfi| 


( 21 ) 


Where, A is the pixels of proposed approach tumour image 
B is the pixels of manual segmented image 


Table 4: Severity analysis for different MR images 


Jaccard 

coefficient value 
(pixel count) 


Tumor MRI images 



0.9828 


0.8766 


0.9912 


0.9622 


0.9821 


0.9607 
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6. Conclusion 

This paper proposes a method for segmenting and 
classifying brain tumour images form MRI images. The 
overall steps of proposed tumour detection and classification 
technique includes four phases namely, pre-processing, 
segmentation, feature extraction and classification. In the pre- 
processing step, the anisotropic filter is utilised for reducing 
the noise and in the segmentation process, K-means clustering 
technique is applied. For the feature extraction, the parameters 
such as contrast, energy and gain are extracted. In 
classification, a modified technique called Cuckoo-Neuro 
Fuzzy (CNF) algorithm is developed and applied to detection 
of tumour region. In the modified algorithm, cuckoo search 
algorithm is employed for training the neural network and the 
fuzzy rules are generated according to the weights of the 
training sets. Then, classification is done based on the fuzzy 
rules generated. The proposed technique was tested on the 
magnetic resonance images of the brain for tumour 
segmentation and its performance was evaluated visually and 
quantitatively. 
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Abstract — Mobile computing has grown and developed in 
recent years with huge popularity. Gadgets like Smart 
phones, Tablets, etc have become trendy by the ease of use. 
Android is more famous platform and turned out to be the 
most important target of Malware developers in precedent 
years. The malware hazard for cellular telephones is 
evaluated to increment security and usefulness of 
smartphones. Hackers and malware program developers 
are benefitted by the limited capabilities and lack of 
standard security mechanism of Android. Nowadays smart 
phones are omnipresent, i.e. they fill numerous needs such 
as data storage, personal mobile communication, 
multimedia and entertainment etc. therefore, 
implementing secure mobile connections is challenging. As 
a result, it becomes essential to have some valuable and 
probabilistic detection along with preventive mechanisms. 
Many preventive tools are available in market but current 
trend for malware security is before installing the app user 
should be able to identify possible threats. Hence we 
propose permission based mobile malware detection 
system. It has 3 components in it 1) Client 2) Server 3) 
Signature Database. In the whole analysis process, Server 
plays important role and user is warned at the end of 
analysis process whether the requested app contains 
malware or not. 

Keywords- Mobile , Android , Malware, Security, Machine 
Learning, Static Analysis. 

I. Introduction 

Mobile malware is a malicious software program with the 
main intention to damage mobile phones. It includes things 
such as virus, worms, Trojan horses, etc. Intent of this 
malicious software could be to steal confidential data, or to 
obtain root privileges. It is an overall scourge.. Study shows 
that effect of malware is deteriorating step by step 
exceptionally in banking and financial section. Thus it is 
important to study different types of malware, their impact and 
their detection techniques. Peer-to-peer networks are used 
widely and are helpless against malware. This potential 
weakness in P2P networks could permit malware to infect, so 
we should ensure a strong protection against malware attack 
using different malware detection techniques. 

Mobile computing has developed so rapidly and fast 
in last 5 years. Smartphones are used for people for online 
shopping, e-banking, online reservations. A mobile malware is 
capable to call premium numbers from contact list, steal 
confidential data and flush all the memory as well as contact 


list. Android has turned out to be the most important target of 
Malware developers in precedent years. The malware risk for 
mobile phones is estimated to increase along with the 
functionality of phones. Android's main protector mechanism 
against malicious apps is a permission based access control 
mechanism. As a result, it becomes essential to have some 
valuable and probabilistic detection along with preventive 
mechanisms. 

There are many malware detection and preventive tools 
proposed in market but our main focus is on android malware 
detection as its been constantly sharing highest market share 
and famous compared to other platforms. We will categorize 
them based on features used to analyze the app as 

1) Static analysis 

2) Dynamic analysis 

3) Hybrid approach 

4) Application Metadata. 

Latest trend in anti-malware tool shows that these features 
are when combined with machine learning algorithm gives 
better results. 

Thus we propose a novel approach for permission based 
android malware detection system which is depends upon 
static investigation. It alerts user if the app is malicious or 
benign based on which user can proceed whether to continue 
with it or not. In this system, the app will detect the label or 
category of the application and accordingly classify them. 

In this paper, Section 1 presents Introduction and 
motivation of mobile security. Section 2 describes the 
Literature Review and Related Work on mobile malware 
detection. Section 3 lists some Malware Detection Strategies 
in smartphone. Section 4 gives overview of Mathematical 
Model and section 5 presents overview of Proposed Work. 
Section 6 Experimental Results are given. In Section 7 
Conclusion and Future scope is given. 

II. Related Work 

The malware research has been begun from the year 
2005 [1]. An broad exploration has been done in this field and 
to fix the issue of mobile malware Jacques Klein et al. 
proposed a strategy called TO fold cross validation' [2]. 
Younghee Park et al. Mark Stamp presented a common and 
typical behavior of malware. They used graph clustering to 
capture such behaviors and then their proposed method 
produces graph in which they have used clustering technique 
[3]. 

Many features like the permission based features (static) 
and the API call based features are considered in order to train 
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the system and classification decision is made using 
probability theory is given by V.Natarajan et. al [4]. Talha et 
al. presented a strategy based on permissions used in an 
application. It uses static investigation and combined with 
machine learning algorithm such as logistic regression to 
detect mobile malware [5]. 

Huy Kang Kim, et al. proposed a system based on similarity 
matching of malware-driven and malware inventor driven 
information which is able to detect and classify malware in 
similar subgroups [6]. DONG Hang et al. invented a system to 
recognize malware in Android gadgets and to capture a 
malware they used streamlines Dalvik directions. This 
technique depends on simplification of instruction [7]. 

Doaa Hassana et al. found the similarity based way to detect 
the malware. In their strategy, similarity between methods is 
computed by using the normalized compression distance 
(NCD). Later with the help of either zlib or bz2 compressors 
similarity measure is computed. System is trained using the 
computed similarity score and then afterword’s results are 
predicted whether the app is malicious or benign [8]. 

Another approach to discover noxious applications is 
discovered by Yajin Zhou et al. They proposed a strategy in 
which they utilized permission based behavioral foot printing 
method to detect malware. Afterword’s they used a heuristics- 
based filtering scheme to recognize behavior of new and 
unknown malware [9]. 

Seung-Hyun Seo, et. Al invented a method to detect mobile 
malware threats to homeland security. In their proposed 
method they defined various characteristics in mobile malware 
and show mobile attack patterns which are feasible. They 
inferred a static investigation tool, DroidAnalyzer, which 
predicts possible attacks from android app [10]. B. Shapira, et 
al. found a system to discover mobile malware which is based 
on semi supervised machine learning regardless of general 
static and dynamic based analysis. [11]. 

Ping Wang, Yu-Shih Wang invented a method based on 
footprinting (signature) based analysis and they used SVM to 
detect malware. They additionally utilized a cross validation 
scheme for improving accuracy of malware detection [12]. 

Karim O. Elish et al. described a method which is based on 
classification strategy which is further used to detect malicious 
android app. Results demonstrated that the strategy proposed 
is highly accurate [13]. 

Jehyun Lee et al. invented a technique for malware 
screening which incorporates method to extract a set of family 
representative binary patterns from already analyzed family 
members as a signature. In evaluation phase it classifies each 
set of variants into a malware family with prior calculation of 
similarity to the signatures. This likeness they utilized 
recognizes malware as a part of their proposed strategy [14]. 
Wanqing You et al. invented a hybrid approach for mobile 
malware detection. In their proposed approach they inspected 
the program execution. The primary advantage of their 
strategy is that they utilized a hybrid approach for analysis 

[ 15 ]. 


III. Malware detection strategies in Smartphone 

A. Android Malware 

1) Adware :-At this atage we don’t know how exactly 
adware malware exists and main intentaion to acces critical 
information by phone. 

2) Info stealers :- As the name suggests primary focus is on 
stealing the information such as contact list and critical stage 
is stealing the passords. 

3) Spy phone:- These apps are very commonly found. 
Typically this app is used to spy on owners phone. 

4) SMS Trojans:- Main intention of this malware is to send 
the SMS and user is charged for the message. 

5) Banking Trojans:- These type of malwares are making 
remarkable move to the banking industry. Typically they are 
involved in banking fruds. 

6) Ransomware:- This form of mobile malware consists 
of encryptors and fake security software. 

B. Behavioral Classification 

Behavior based classification is also refereed as anomaly 
detection. In such detection strategies normal behavior of the 
malware is as mapped against established dataset of normal 
behavior. If mismatch occurs between both then it is 
considered as malware. Advantage of such system is, many 
new detections are captured which were not found earlier. 
Major disadvantage of such system is major false positives are 
confirmed. 

Following fig. 1 shows the classification of malware based on 
behavior. 
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Figure 1. Malware Behavioral classification 
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C. Frequently Used Features in Mobile Malware Detection 

There are three broad categories of features 
Such as 

1 ) Static Feature 

2 ) Dynamic Feature 

3) Hybrid Feature 

All the features can be summarized diagrammatically as below 



Figure 2. Taxonomy of Mobile Malware Features 


x=independent variable 

a=Threshold value for each application which can be different 
for each application 
a=intercept value of x 
b=dataset value limit 

Calculating Malware score 

PMS Number of malware that uses that permission /^\ 

Number of all malware V ^ 

AMS= X PMS 
Where, 

PMS=Permission malware score. 

If AMS > threshold a, then requested application can be 
malicious. 

Else if AMS < threshold a then requested application can 
benign. 

0={M,B} 

M=Malicious 

B=Benign 


IV. Mathematical Model 


Finally, calculating the accuracy of the application 


Set representations: 

Let ‘S’ be the system which predicts whether application 
contains malware or not using machine learning technique 
S = {X, I, Y, D, P} 

Where, 

X= Input to the system 
^Intermediate phase of the system 
Y= Output to the system 


X={D} 

D = Set of datasets 
D = {Da, Db} 

Da=Malicious dataset 
Db=benign dataset 

Da ={dl,d2,d3, dm} 

Db={dl,d2,d3 dn} 

m ^ (D 
n ^ cd 

I={TD,TVC,MSC} 

TD=Training the dataset 
TVC=Threshold value calculation 
MSC=Malware score calculation 

Building permission database 
P = {Set of permissions} 

Such as 

{ android.permission.CALL_PHONE, 
android.permission. SEND_SMS , 
android.permission.READ_CONTACTS, 
android.permission.CHANGE_CONFIGURATION, 
android.permission.INTERNET, N} 


Accuracy = 


TN+TP 

TP+FP+FN+TN 


( 3 ) 


The complexity of above proposed strategy is O(nlogn). 

We are classifying the malware using machine learning 
technique. Calculating threshold plays an important role here 
to compute it from proposed model above it takes less than 
100 iterations. So the time complexity of the above proposed 
model is O(nlogn). 


V. Proposed Work 

To overcome the problems caused by mobile malware, we are 
proposing a system which is based on feature selection as its 
first phase. Second phase consists of classification based on 
Logistic regression and Decision Tree algorithm and finally 
we are evaluating the performance of the system by computing 
its accuracy. All the procedure is described in below image. 



Figure 3. Feature Extraction Framework. 


D (one) > P (many) 

PGD 


Training dataset 

e a+bx 

( 1 = 

1 + e a+bx 


0) 


A. Features 

There are total 144 permissions for android Kit Kat(4.4)[5] .At 
first we need to retrieve some features (permissions) from 
APK files. These required features are fetched from 
androidmanifest.xml file. These extracted permissions we are 
storing in SQL file. We are considering the permission count 
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for detecting the malware. Figure 4 shows the how we are 
extracting and storing the permissions. 

We will discuss some sample features 

android.permission.CAMERA:- This permission is requested 
when device needs to access camera. 
android.permission. ACCESS FINE LOCATION : -This 
Permission typically grants access to precise location. 
android.permission. ACCESS WIFI STATE: - This Permission 
allows to access information about WIFI state. 
android.permission.BATTERY STATS:- This permission 
allows device to collect battery statistics. 
android.permission.READ CONTACTS:- The application can 
read users contacts. 

android.permission.INTERNET : - With this permission 
applications can open Network Socket. 


Browse Apk 

(o Infected APK 

' Non- Infected APK 

Ma^se 

List Apk Scanned 

Permissions of Apk - 



com threed bowling 
im.balysv.loop 
com alkalinelabs bandgame 
jalfonso brain games 
>m .bigduckgames flow 


Peimission Name 

Peimission ^android pefmission.ACCESS_NETWORK_ST A... 5 

Pemiission >androidpeimission.lNTERNET 

[Pennission >android. peimission .WRITE_EXTERNAL_STO... 3 
Peimission >android permission. READ_PHONE_STATE 

Peimission >com.android.vending.BILUNG 

Peimission >andioid peimission.READ_EXTERNAL_STOR... 



users associate degree analysis to each native application on an 
server device and remote applications on Play Store. This 
analysis is kept specifically on server as limited computing 
power of mobile compared to PC. 

APK Auditor Signature Database : Application's area unit 
keeps within the APK Auditor signature information along 
with the results of the analysis. APK Auditor server classifies 
applications through these permissions, supported their 
existence in malwares. Service and receiver info is neglected 
as a result of their application specific definitions and also 
keeps analysis results for every application area unit displayed. 

APK Auditor central server: The APK Auditors central server 
governs and monitors analysis method and works as a 
mediator between signature info and client whereas analyzing 
requested applications. The central server will download high 
rated applications from Play Store and analyzes them. This 
official market needs authentication associate degreed a tool 
symbol so as to transfer an application. It will set the threshold 
using logistic regression function. Afterword’s in evaluation 
phase it will predict the whether the app is malicious or 
benign. 

Proposed algorithm is using ID3 decision tree 
Steps are mentioned as bellows. 

1) Calculate the entropy of each and every attribute from the 
data set S. Dataset S contains the malicious as well as benign 
apks. Attributes considered from the dataset are malware 
score, permission count for malicious dataset, permission 
count for benign dataset Threshold value. 

For that we have to use the formulas below [16]. 

H(S ) = ~ 'Z xeX p(x)log 2 p(x) (4) 


Figure 4. An example of permission extracted from malware . 

B. Feature Extraction 

This subsection explains the broad steps which we followed to 
obtain required data from apk files. 

1 . Download and collect the malicious and benign 
applications. 

2. Decompress the applications and extract the content. 

3. Extract the permissions from the files. 

4. Build the dataset. 

C. High Level Architecture and Proposed System 

The figure 5 shows the APK Auditor is a permission-based 
malware assessment system. APK Auditor consists of 3 main 
components: 

1. APK Auditor Client, 

2. A signature Database, 

3. A Central Server that communicates with client and also 
with the signature info and handles the analysis method. 

The Fig. 5 [5] presents a summary of APK Auditor's computer 
code design. 

APK Auditor Client: APK Auditor client merely offers 
associate degree analysis request, showing whether or not the 
application is trustworthy or not. This client application lets 


Where, 

S - Data set 
X - Set of classes in S 

p(x) - The proportion of the number of elements in class x to 
the number of elements in set S. 

Then Information gain is calculated [16]. 

IG(A, S) = H(S) - Y,teTV(tx)H(t) (5) 

Where, 

H(S)- Entropy ofS 

T - Subsets created from splitting set S by attribute A such 
that S=U t. 

p(t) - The proportion of the number of elements in t to the 
number of elements in set S. 

H(t) - Entropy of subset t. 

2) Classify a subset of attributes for which information Gain 
maximum. (or ultimately entropy is minimum). 

3) Form a decision tree node which contains that attribute. 

4) Recursively performs the same for remaining subsets of 
attributes. 
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Figure 5. Software architecture for mobile malware detection. 

D. Dataset 

We have collected a malware dataset from malware repository 
called Contagio and malware. lu [18] [19]. From contagio we 
have collected 30 malware apks and from malware.lu we have 
collected 80 malicious apks. To collect benign apks we have 
used Google Play store. We have collected 75 clean apks. 

In first dataset we have successfully analyzed 80 permissions. 
In the second dataset we have successfully analyzed 140 
permissions. 


TABLE I 


Dataset 

No of APK samples 
used 

Permissions Analysed 

Dataset 1 

40 

80 

Dataset 2 

126 

140 


In the next phase when we increase our dataset and train the 
system using 140 permissions then we found accuracy is 
increased, and classification percentage comes out to be 96% 
resulting in decreasing FPR. 

Again when we evaluate the system using decision tree then 
accuracy noted here is 95% and false positive ratio is 4%. 


p 1 - 1 App_Test_Results ^ 


Browse 


Adobe Acrobat Reader_v15.3.1_apkpure.com.apk 
BlackPlayer Music Player_v2.18_apkpure.com apk 
C Programming_v3.0_apkpure.com apk 
Calculator Vault s new pin pad.vl 000.28_apkpure.com .apk 
Call Blocker Free Blacklist_v5.2.18.00_apkpute.com.apk 
File Manager Rle Transfer_v1.0.1_apkpure.com.apk 
Hide Pictures Videos FotoX_v1.5_apkpure.com.apk 
Kitchen Radio Music player_v0.84_apkpure.com.apk 


Test Apps 


Clear All ~~| 



Figure 6. Apk Evaluation Process. 


TABLE II 


Algorithm 

Accuracy 

FPR 

Logistic 
Regression 
Dataset 1 

90% 

9% 

Logistic 
Regression 
Dataset 2 

96% 

2% 

Decision tree 
Dataset 2 

95% 

4% 


E. Evaluation Setup 

We have Implemented proposed model explained in section 
IV using .net 2010 on windows 7. In training phase it will 
analyze the apk file ,and gives permission statistics. In 
Evaluation phase it will predict the about the app whether it is 
malicious or benign. Figure 6 shows the app evaluation 
process. 


VI. EXPERIMENTAL RESULTS 

Our aim is to enhance the security of the smartphones as well 
as to enhance the accuracy. In first dataset of 40 apps we have 
used 18 benign apps and 22 malicious apps. In Second dataset 
We have used 75 benign apps and 51 malicious apps. 

Table 2 shows the results, when we evaluate over the dataset 
as mentioned in Table 1. Final results are shown in the form of 
Accuracy percentage and False positive ratio. 

As mentioned in table II when we evaluate our system over 40 
dataset using Logistic Regression accuracy of the 
classification is 90% and false positive ration is 9%. 


A Performance Evaluation Criteria 


We are using machine learning technique to classify the 
malicious behavior of app. So performance Evaluation is 
based on confusion matrix. Following 4 items define the 
members of confusion matrix. 

> True Positive:- Number of correctly classified benign 
application. 

> False Positive: -No of incorrectly classified malware 
applications. 

> True Negative:- No of correctly classified malicious 
applications. 

> False Negative:-No of incorrectly classified benign 
application’s. 


Accuracy = 


TN+TP 

TP+FP+FN+TN 


( 6 ) 


> False Positive Ratio:- Percentage of wrongly 
classified malware applications. 
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fpr = ff^ ^ 


VII. Conclsion And Future Scope 

The mobile security has gained wide attention in past few 
years. There are many more techniques available to tackle the 
mobile security issue. Proper literature review and deep study 
will explore many more methods to capture the malware. 
Different methods have different accuracy impact on malware 
detection. 

In some tools we will find increased classification accuracy 
and in some tools we will find although classification accuracy 
is good FPR ratio is high. It is possible to have other methods 
than the proposed methodology in the paper. 

The primary goal of our system is to enhance security and to 
improve accuracy of the malware detection. The accuracy of 
malware detection is much more dependent on the malicious 
and benign dataset. Dataset plays important role in setting 
threshold value of computation. Many preventive tools are 
available in market but main drawback observed is False 
positive ratio is high in such tools. As we are dealing with 
permissions of apps while analyzing, privacy should be 
maintained and should not reveal the confidential information 
to each other. 

The immediate next future direction of our system is, it is 
based on static analysis so capturing 0 day malware and 
runtime behavior of the features should be further studied, 
which in turn will predict correct label. It is possible to detect 
mobile malware using other machine learning technique such 
as SVM, Naive Bays Theorem etc. Our existing system is 
scalable with such techniques and should be further studied 
with the same. 
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Abstract — Increasing amount of dependability on 
computer networks and internet services are also 
increasing intrusions. Intrusion Detection System (IDS) 
tools detect the intrusions and produce alerts. An 
automated Intrusion Response System (AIRS) is required 
to analyze the alert and trigger appropriate response to 
mitigate the intrusion without delay. In this paper, cost 
evaluation methods and response decision making 
capabilities of various AIRS models are analyzed. Various 
decision making factors that are involved in the response 
selection process are also identified and then categorized in 
response, attack and system level factors. 

Index Terms — Intrusion Response System, AIRS, 
Response selection, Response factors, Response cost. 

I. Introduction 

Network attacks or intrusions incidents are becoming 
more complex and increasing with the rapid expansion 
of the networks and services over the networks. 
According to PWC Global State of Information Security 
Survey-2016 [1], the intrusion incidents are increased by 
38% compared to the previous year. The constant 
increasing incidents required a powerful defense 
mechanism in order to overcome the organization loss. 

There are the tools available for monitoring the 
intrusion on the system called Intrusion Detection 
Systems (IDS) that generates the alerts to the network 
administrator in case any intrusion is detected. For the 
network administrator, it becomes very complex and 
tedious to respond the each alert without any delay, 
exclusively when the generated alerts are in hundreds, 
thousands or even more in numbers. So an Automated 
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Intrusion Response System (AIRS) is required to handle 
the large amount of alerts and respond appropriately 
without much delay [2], [3]. 

The goal of an AIRS is to select automatically the 
correct response for the alerts to mitigate the attack with 
less penalty cost [4]. An effective AIRS should be 
designed so that it should have adaptive and cost 
sensitive characteristics with achieving the balance 
between the response cost and the damage cost [3]. 

Various frameworks and models are proposed for the 
AIRS, which considered various factors and cost 
evaluation techniques for the selection of the response. 
In this paper, the comparative study and analysis of the 
various available AIRS models for cost evaluation and 
response selection are presented. The comprehensive list 
of factors that has been used for the decision making for 
response selection is also identified and categorized. 

This paper is organized as follows. Section II 
describes the various available models and framework 
for the intrusion response system, Section III presents 
the analysis of the various AIRS models. Section IV 
presents the discussion and section V is the conclusion. 

II. AIRS Response Selection Models 

Sven Ossenbuhl et al. [5], proposed REASSESS 
(Response Effectiveness Assessment) response selection 
model containing five phases named - input & 
configuration, alert processing, response selection, 
response execution and documentation. REASSESS 
model compares the available responses and for a given 
alert provides a most suitable response (having highest 
effectiveness). The comparison is modeled with the 
response effectiveness which is evaluated on the basis of 
positive and negative effects of a response. The negative 
impact is evaluated with the help of the importance of 
the service and the capability reduction that depends on 
the performance of system/services either due to a 
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deployment of a response measure or in the event of an 
attack. Positive effects are based on the response success 
rate which is further dependent on the ratio of the 
number of deployments of response that successfully 
mitigated the attack and the total number of deployments 
of response. 

The evaluation of model is done based on the 
following characteristics - Automatic deployment, 
Scalability, Adaptability, System independency, 
Calculation efficiency, Usability Security mechanisms 
and found the model comparatively better performing 
than the ADEPTS, CS-IRS and IE-IRS. 

They used the amount of alerts to measure the 
performance of the proposed model. According to the 
authors, there are various challenges that need to be 
focused on like determining the set of eligible responses 
for a system and the impact of an attack on the given 
system. 

Aderonke Justina et al. [6], have presented the COSIRS 
(Cost Sensitive Intrusion Response System) response 
cost assessment model by considering the three factors - 
damage cost caused by the intrusion, response cost and 
the operational cost. COSIRS contains two major 
components - Intrusion Detection System (IDS) and the 
Response Engine (RE). Further, the RE is divided into 
various sub -components - alert filter & correlation, 
response manager (RM), database (contain information 
about intrusion specification, profile and response 
actions), adaptability module, cost sensitivity evaluation 
module (CSEM) and response deployment module. 

The Intrusion Cost (the cost of damage caused by an 
intrusion i n ) is evaluated using the intrusion impact on 
the system and operational cost (cost of daily 
maintenance of various aspect of the detection system). 
Intrusion impact on the system is derived with the help 
of the set of resources provided by the system, a number 
of resources which are being affected by the intrusion, 
the severity level of the attack, security policy 
(confidentiality, integrity and availability) and the 
weight of each security policies. 

Response cost is derived using Response impact on 
the system and Operational Cost. Response impact on 
the system derived based on the number of available 
response ranks, a number of resources affected, 
resources affected by the deployed response and weight 
of each security policy. 

Natalia Stakhanova et al. [7], proposed the cost sensitive 
automated IRS model with pre-emptive and adaptive 
characteristics. The proposed model relies on the pattern 
based IDS which can represent the normal and the 
anomalous pattern of the system behavior in state- 
transition graph. The model manually associated 
response action with each known intrusive pattern in the 


abnormal graph. The preemptive response is deployed 
only if the prefix pattern (of known attack sequence) 
matched with monitored sequence with greater than the 
pre-defined probability threshold. The probability of 
occurrence of the sequence is referred as a confidence 
level. 

Confidence level = (number of Sequence-Occurrence / 
total number of sequences with this prefix ) 

From the set of candidate response, the selection of 
the appropriate response is determined based on damage 
cost (DC) and response cost (RC). They have associated 
the damage cost using prior information to each attack 
pattern. The response is selected for which the condition 
DC * confidence level>RC holds. 

The success factor (SF) and risk factors ( RF ) are 
considered for the selection of the optimal response 
action which provides maximum benefit with the lowest 
risk. Used utility theory and defined expected value (EV) 
of response r s to a sequence S as: EV(r s ) = 
(Pr succ (S)*SF) + (Pr risk (S)*( -RF)), where Pr succ (S) is the 
probability that sequence S will occur and Pr risk (S) = 1 - 
Pr S ucc(S). Based on the highest EV the optimal response 
is selected. The model achieves adaptability by adjusting 
the SF by increasing or decreasing by one for every 
success or failure respectively. 

Chris Strasburg et al. [8], presented a host based 
framework for cost sensitive assessment for intrusion 
response selection with the goal of selecting a set of 
responses so that, given a possible intrusion set 
minimizes the potential system damage. The factors 
associated with the response action evaluation are 
broadly defined into two groups called factors 
associated with intrusion damage and factors describing 
response cost. 

The intrusion damage D further categorized as: 
deployment of response due to false alarms (D FalseAlarm ), 
deployment of no response when an intrusion occurs 
(false negative alarms) (D FalseNegative ), deployment of sub - 
optimal response due to a miss-labeled alarm 
(DMisiabeiedAiarm) and deployment of response for true 
attack (D true ). 

The system damage caused by the true attack is 
defined using three components: System resources 
affected by intrusions - such as services provided by the 
system (FTP, HTTP, etc.), the resource importance for 
system confidentiality, integrity & availability and the 
weight of these factors on the security policy of the 
system. The damage cost is evaluated with the intrusion 
impact on system resources and operational cost. 

The response cost estimation is composed of three 
components - Operational cost (OC), response goodness 
(RG) and response impact on the system (RSI). So 
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Response Cost RC for applying a response r for intrusion 
I is: RC(r, I) = OC + RSI - RG 

Yu Sun et al. [9], proposed an Aggregation and Cost 
Based Automatic Intrusion Response System (AC AIRS) 
model containing the components: IDSs, Interface, Alert 
Aggregation, Response Process Unit (RPU), Response 
Actions, Response Cost, Response Policy and Response 
Log. They mainly focused on the aggregation of the 
alerts, for which the alerts are categorized mainly in four 
categories: U2R, DOS, R2L, and PROBE, which is 
further partitioned according to the attack type, 
destination IP address and interval. The aggregation is 
defined based on the similar degree of each intrusion 
incident S = A + D + I, where A, D and I are the similar 
degree of attack type name, destination IP address and 
interval respectively. Further A, D and I are described 
as: 

A = {1 and 0} for {names are identical and names are 
not identical}. D = {1, 0.5 and 0} for {IP addresses 
are identical, IP are not identical but on the same 
subnet and IP addresses are not on the same 
subnet} and I = {1, (30-T)/20 and 0} for [T<10min, 10 
min<T<30 min and T>30 min}. There will be defined a 
threshold for each of the four categories. Each incident 
alert N will be compared with the N-l alerts, so SN = 
{SI, S2, .... SN-1 } and the S M = MAX ( S N ) is 
calculated and compared with the threshold, if found less 
than considered a new alert else treated as the repeated 
alert. After aggregation process complete they have 
mentioned the response selection process by writing the 
generic rules with the condition of Dcost < Rcost , where 
Dcost and Rcost are damage and response cost 
respectively. 

Bingrui Foo et al. [10], presented automated intrusion 
response mechanism called ADEPTS, based on the 
intrusion graph called I- GRAPH, with a feedback 
mechanism for evaluating the deployed response. /- 
GRAPH models the knowledge about intrusion where 
each intrusion goal is represented by one node in the 
graph with dependency relationships and edges are 
categorized as OR, AND, and Quorum edges. 

For generating the I -GRAPH, used a semi-automated 
method called I- GRAPH Generation (PIG), which takes 
two inputs: vulnerability descriptions and system 

services description (SNet). SNet is directed graph 
(created manually); where individual services are 
represented by nodes and the edges between nodes A & 
B represent the intrusion centric channel, means if A is 
compromised then the intrusion can spread to B through 
the channel. They defined five kinds of channels: DOS 
channel, Network channel, Shared file channel, Shared 
memory channel and Super channel (the combination of 
the other channels). The second input to PIG, 


vulnerability descriptions, can be obtained from any 
common vulnerability databases, such as CERT, 
Bugtraq, CERIAS-VDB etc. 

To determine the response location they have 
proposed the response set computation based on the alert 
confidence (provided by the detector or set to one) and 
compromised confidence index (CCI) computation. CCI 
of a node represents the likelihood that the node has been 
achieved. 

To compute response set the I-GRAPH is traversed 
in reverse order of CCI computation until all reachable 
nodes are traversed at most once and during traversal, 
each node is labeled as: Strong Candidate (SC), Weak 
Candidate (WC), Very Weak Candidate (VWC) and 
Non-Candidate (NC). So, SC label on a node is a strong 
indicator that the node has been achieved, while the WC 
or VWC label indicates smaller likelihoods. After this in 
the case of conservative policy, all SC nodes that have at 
least one immediate NC parent node are chosen and 
placed in the response set, whereas in moderate policy 
all SC and WC nodes and in aggressive policy all SC, 
WC & VWC nodes are placed in the response set. 
Finally, the deployment of the response is achieved with 
the help of Response Repository, Response Control 
Center and distributed Response Execution Agents. 
Response with highest Response Index (RI - calculated 
using Effectiveness Index and Disruptiveness Index) is 
chosen to deploy by the control center. 

Zheng Wu et al. [11], presented the response decision 
model based on Analytic Hierarchy Process (AHP), 
which uses the pair-wise comparison to represent the 
relative importance of one criterion over another, 
avoiding the drawback of accurate measurement for 
influence factors in order to select the proper response 
mechanism. They described the general IRS in four 
components: Response Policy Library (RPL), Response 
Decision Module (RDM), Response Implementation 
Module (RIM) and Response Tools Library (RTL). 

They have considered the common four influence 
factors for response selection that are: Attack Restraint 
(AR), Service Maintenance (SM), Time Spending (TS) 
and Resource Consumption (RC). Developed the AHP 
model with three layers, where the root layer 
representing the response selected and the middle layer 
representing the criterion representing the various factors 
as mentioned above & the bottom layer represent the 
response alternatives options and created the relative 
matrix of “response selected”. They categorized the 
intrusion as: Information Gathering (IG), Right 

Escalating (RE), File Operation (FO) & Resource 
Depletion (RD) and presented the relation of intrusion 
and the common response in the form of the table. 
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Chengpo Mu et al. [12], presented the various response 
factors that are applicable in the response decision 
making models. They identified 15 different factors and 
classified as: Attack-related factors - Alert confidence, 
Attack type, Attack severity, Attack amount, Attack type 
amount, Attacker type & Attack time; Response-related 
factors - Response goal, Response intensity, Response 
negative impact & Response effectiveness and Target- 
related factors - The importance of resources, Exposed 
extent of vulnerability, Service capability of system & 
Policy constraint. 

Further, they have categorized the factors in Objective 
factors - which can be directly get from IDS alerts, 
vulnerability scan tools & network configuration and 
Subjective factors - which can be determined by 
administrators and experience. 

They have analyzed the response factors and 
concluded with what factors are important for the 
response time decision making and what for the response 
measure decision making. 

Chengpo Mu et al. [13], have presented Intrusion 
Detection Alert Management and Intrusion Response 
System (IDAM&IRS), which is an intrusion response 
decision-making model based on hierarchical task 
network (HTN) planning having response measure 
decision-making as well as response time decision- 
making capability with the features of self-adaptive and 
balancing of response effectiveness & response negative 
impact. HTN is a planning system that searches for a 
sequence of actions to achieve the desired goal, referred 
as intrusion response plan. 

They represented the hierarchical structure of 
IDAM&IRS response planning based on an intrusion 
scenario that is being detected and responded, response 
goal set by an administrator (such as analyze the attack, 
catch the attack, mask the attack, maximize 
confidentiality, maximize data integrity, minimize cost 
etc), response strategy corresponding to the response 
goal and response key points. Different subtasks and 
their orders in KP can produce different strategies in the 
response process. They have mentioned 13 response key 
points (KP) such as general alarm subtask PI, reinforced 
alarm subtask P2, general evidence record subtask P3, 
and weak attack block subtask P7 etc. Further, they have 
refined the response goal into various subtasks. 

The response time decision making methods are 
described for different subtasks such as, for alarm 
subtasks, evidence record subtasks and backup subtasks 
(executed at the host level) the time decision-making 
model expressed as: 


IF RI k H > RIH pi THEN P, BEGIN AND T pi = t, 
i e {1,2, 3, 4, 5, 6} 

Here, rih is the risk threshold in the host level and 

pi 

the above equation represent that when the risk ri k H 
caused by an intrusion scenario k in a host is greater than 
or equal to the threshold RIH . , then begin subtask Pi 

and take the time t as the beginning time of subtask Pi. 

The response scheme decision making methods also 
defined for various subtasks such as the response 
measure decision making process for alarm subtasks, 
evidence record subtasks, backup subtasks, block 
subtasks and counterattack. 

Wang Zeng-quan et al. [14], analyzed the various 
elements that are important for designing the adaptive 
response system. They presented the alarm matrix A = 
(a^) and a y express the number the attack type i alarmed 
to j by IDS, where row i express the attack type and 
column j express alarm type from IDS such that i>l, j<n 
and n is the number of attack types. They divided the 
matrix into four region - region X having the elements 
a ii? which means attack type i is exactly identified as 
alarm i called detection nicety rate of IDS. Region Z 
having elements ay which express the attack type i 
identified to j by error. Region Y having elements a i?n+ i 
express the number of missing alarm of IDS. Region U 
having elements a n+ y express the number of alarm of 
IDS when intrusion not happened. 

With the aid of the alarm matrix, they analyzed the 
various factors involved in the designing of IRS such as: 
analyzed the various properties of IDS like Attack 
Frequency, Alarm Frequency; analyzed the quality of 
IDS with five aspects - Detection preciseness frequency, 
failing frequency, distorting frequency, alarm reliability 
and IDS efficiency. Finally, they have done the analysis 
of various expenses and response cost evaluation. 

III. Analyzing various AIRS Models 

Analysis of the AIRS models based on damage & 
response cost evaluation, response selection process and 
the various decision making factors for the response 
selection process are presented in the table- 1. Response 
cost represents the effect of response on the system 
where as the damage cost specifies the amount of 
damage caused by the attack. Factors are identified and 
classified into three categories - Response Factors (RF), 
Attack Factors (AF) and the System Factors (SF). 
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Table-1: Analysis of various AIRS models 


s 

No 

Model 

Damage Cost (DC) and Response Cost (RC) evaluation 
techniques 

Response selection Process or 
Techniques 

Factors Considered 

1 

Sven 

Ossenbu 

hi, 

(REASS 

ESS) 

A n = (F d (e) + S(e))/2 G [0..1] 

A p = rsr G [0..1] 

F d (e) = 1 — F(s) G [0..1] 
rsr(r i? aj) = dps ri (aj)/dpt ri (aj) G [0..1] 

Here, 

A p and A n are positive and negative effects 

S(s) and F d (s) are importance of service 8 and the capability 

reduction 

F(s) performance of an entity 8 and rsr(r i? aj) is the response 

success rate for a given response p and alert aj. 

dps r i (aj) - number of deployments of response p in case of aj 

that successfully mitigated the attack 

dpt r i (aj) - total number of deployments of response p in case 

of aj 

• Based on highest response 
effectiveness 

E(r) = A p - A n G [0, 1] 

• Cost sensitive 

• Having the Learning ability 

AF 

• Nil 
RF 

• Response 
effectiveness 

• Positive 
effect/Response 
success rate 

• Negative effect 
SF 

• Service importance 

2 

Aderonk 

e 

Justina, 

(COSIR 

S) 

Damage cost (damage caused by an intrusion in) 
DC in = IS, + OCi 

Z E ,( sr i c °j) 

ST i G SR 

k * m 

Response cost RC r = IS r + OC r 
r 

• Not clearly mentioned the 
process of selecting the 
responses, instead given the 
method of evaluating the 
damage and response cost. 

• Cost sensitive 

• Although Adaptability module 
is mentioned but no method is 
clearly described for the 
calculation of the 
effectiveness of the previous 
response and further learning 
through feedback 

AF 

• Attack severity level 
RF 

• Response cost 
SF 

• Damage Cost 

• Resources affected by 
Intrusion 

• Intrusion impact on 
system resource 

• Security policy 

r — 

k* m 

Where, 

ISj and OCi represents the intrusion impact on the system 

and operational cost 

sp - resources provided by the system. 

m = number of resources which are being affected 

Ei = severity level of the attack 

j = security policy (CIA) 

CO j = weight of each security policy 

k = a normalization value 

IS r and OC r represents response impact on the system and 
operational cost 

n = total number of available response ranks, 
m = total number of resources affected, 

Sn = resources affected by the deployed response 

3 

Natalia 
Stakhan 
ova et 
al., 2007 

Confidence level = number of Sequence-Occurrence / total 
number of sequences with this prefix 
select response actions while DC*confidence level>RC 
Expected Value (EV) of response r s to a sequence S: EV(r s ) 
= (Pr succ (S)*SF) + (Pr risk (S)*( - RF )) , where Pr succ (S) is the 
probability that sequence S will occur and Pr risk (S) = 1 - 
Pr SUC c(S). 

• Based on the highest EV the 
optimal response is selected 

• Cost sensitive 

• Adaptability is achieved by 
adjusting the SF by increasing 
or decreasing by one for every 
success or failure respectively 

• Using State Transition Graph 
to model the system behavior 

• Associating response action 
with each known pattern of 
anomaly - done manually 

• Methods to evaluate Damage 
and Response cost are not 
explicitly mentioned 

AF 

• Alert Confidence 
level 

RF 

• Response Cost 
SF 

• Damage Cost 

4 

Chris 
Strasbur 
g et al., 
2009 

Damage Cost by true attack 

D tme (i k ) = SI(i k ) + OC(i k ) 

Where, 

intrusion system impact: 

Sl(i t )= 2 E(i k ,s rj )xW srj 

sr j g SR 

OC(i k ) G [0..1] 

• Selecting a single response 
based on the response 
effectiveness value RV(r, I s ) 

• Cost sensitive 

• Host based 

AF 

• Nil 
RF 

• Response Cost 

• Response goodness 

• Response success 
level 

• Previous response 
success percentage 
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Response Effectiveness Value - 
RV(r,I s ) = Y j RC(r,i)-{D(i)xSF r i x Pl xE(IDS)) 

ie I s 

Where 

RC(r, I) = OC + RSI + RG 

SF rji = Pr success (r, i) x Si eV ei 

Where 

sq = System resource 

Wsq = weight of each sq based on the security policy 
Effect of i k on system resource sq is denoted by: E(i k , sq) 


SF 

• Damage Cost 

• Resources affected by 
Intrusion 

• Intrusion impact on 
system resource 

• Security policy 

5 

Yu Sun 
et al., 
2008 
(ACAIR 
S) 

• Using Alert Aggregation based on Similarity - 

S = A + D + I 

Where 

A (attack type name)= { 1 and 0} 

D (destination IP address) = { 1, 0.5 and 0} 

I (interval) = { 1, (30-T)/20 and 0}, for {T<10min, 10 
min<T<30 min and T>30 min} 

• Responses are categorized in 
passive and active responses 

• Response Selection is based 
on generic rules with the 
condition of DC>RC 

• Methods to evaluate Damage 
and Response cost are not 
explicitly mentioned 

AF 

• Alert Confidence 
RF 

• Response Cost 
SF 

• Operating System in 
use 

• System 
Vulnerabilities 

• System Importance 

6 

Bingrui 
Foo et 
al., 2005 
(ADEPT 
S) 

Compromised Confidence Index of a node - 
f alert confidence no children 

CCI = \ f ' (CC7 . ), no det ectors 

If {f '{CCI • , alertconfi dence )) else 

max( CCI ■ ), OR edge 

f'= \ min( CCI .), AND edge 

f mean {CCI ■ 1 CCI . > t), Quorum met 

[0, quorum not met 

Here, function f represents the statistical mean, and CCf 
representing the CCI of the i th child and T is a per node 
threshold 

• Adaptive by using response 
feedback to adjust the El 

• Response selection is based 
on response index (RI) 

• Choose the response with the 
highest RI 

• Handling unknown alerts by 
reporting and applying 
general responses 

AF 

• Alert Confidence 
RF 

• Response 
effectiveness 

SF 

• System 
vulnerabilities 

• System services 

7 

Zheng 
Wuet 
al., 2008 

Based on Relative matrix - 
A = (aij) (i, j =1,2,3,...) 

aij is the comparison value of alternatives in contribution to 

one criterion as - 

if a^ = a, then a f j = 1 /a, a!=0 

if Ci is the same importance as Cj, then ay =1, a_ji = 1 

The priority of each alternative is calculated by eigenvector 
of the relative matrix 

• AHP is used to select the 
Optimal response 

• Not dependent on the accurate 
measurement of the decision 
making factors 

AF 

• Attack Restraint 
RF 

• Time Spending 
SF 

• Resource 
Consumption 

• Service Maintenance 

8 

Chengp 
o Mu et 
al., 

2010a 
and 
2010b 
(ID AM 
&IRS) 

Created Intrusion Response Plan as - 

Z = {k,'V,C,KP} 

time decision-making model - 

IF RI k H > RIH pi THEN P. BEGIN ANDT pi =t , 

i e {1,2, 3, 4,5, 6} 

• Based on hierarchical 

• task network planning (HTN) 

• Uses effective index (El) and 
Disruptive impact index 

• (DI) 

• Using response time and 
response measure decisions 

• selects the response measures 
with the highest El / DI ratio 

AF 

• Nil 
RF 

• Response time 

• Response 
effectiveness 

• Negative effect 

• Response goal 

• Response policy 
SF 

• Nil 

9 

Wang 
Zeng- 
quan et 
al, 2006 

Alarm Matrix 
A = (ay) 

a^ represent the number the attack type i alarmed to j by IDS 

• Not clearly mentioned the 
methodology of response 
selection, but work on the 
evaluation of the various 
factors 

AF 

• Alarm confidence 
Level 

• Attack frequency 
RF 

• Response cost 
SF 

• Risk assessment 
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It can be noticed from the above analysis that most of the 
models are missing attack factors in order to the selection of 
the appropriate response. Many models not explicitly 
mentioned the computation of the response selection process. 
Most of those only considered true positives (i.e., the number 
of correct responses) for checking the effectiveness of their 
approach. False positive must also be taken into account. 

It is important to know how responses for AIRS have been 
wrongly identified. The online risk assessment component is 
not tightly integrated and attuned with the response systems. 
There are no correlations between the responses in almost all 
the above mentioned models. These require the further depth 
research in order to make a complete AIRS. 

IV. Discussion 

Automated Intrusion Response Systems (AIRS) are very much 
essential in today scenario where many services and 
businesses depend on the networks and there is a rapid 
increase in the intrusions on the networks. In order to design 
the AIRS effective and efficient, it is required to identify and 
choose various decision making factors for selection of 
optimal response precisely. 

Based on the above study, various factors that can be 
considered for developing more effective AIRS is divided 
broadly into three categories as mentioned in table-2. 


Table-2: Decision making factors 


s 

No 

Category 

Factors 

1 

RF 

• Factors related 
to Response 
measured 
and/or assigned 
by the network 
administrator or 
the experts 

1 . Response Cost/Negative effect 

2. Positive effect 

3. Response effectiveness 

4. Response success rate/ Response 
goodness 

5. Previous response success 
percentage 

6. Response Time 

2 

AF 

• Factors related to 
intrusions/Attack 
and can be 
obtained from 
the IDS systems 

1 . Alarm confidence 
Level(prob ability of 
occurrence of intrusion) 

2. Attack frequency 

3. Attack severity level 

3 

SF 

• Factors related to 
the System 

1 . Service/Resource/System 
importance 

2. Service/Resource/System 
performance 

3. System Resources affected by 
intrusion 

4. Intrusion impacts on system 
resource 

5. Operating System in use 

6. Damage cost 

7. Security Policy (CIA - 
Confidentiality, Integrity, 
Authentication) 


All the models not showing the uniformity on the factors that 
should be involved on decision makings. In general, it is found 
that the Alert confidence, damage cost, response cost, 
response effectiveness and the importance of resources are 
commonly used in all kinds of decision-making models 
mentioned above. 

The methodology that the mentioned AIRS models used for 
response selection process can be categorized as cost-sensitive 
based, graph based, analytic hierarchy process (AHP) based 
and hierarchical task network (HTN) planning based models. 
Although most of the models consider the response 
effectiveness as key factors for response selection, but all the 
factors mentioned in table-2 in response category should also 
be required to be considered in order to design the most 
effective AIRS. 

V. Conclusions 

In this paper, we analyzed various AIRS models. The decision 
making factors considered in models are identified and 
categorized based on the response, intrusion and system level 
factors. It is found that mostly all the models are not 
synchronized in terms of decision making factors that they 
have considered. Therefore, a performance comparison of the 
models with each other might produce the ambiguous result. 
There are total 16 factors altogether that are identified based 
on the study that can be considered for an effective design of 
the AIRS. But one of the biggest challenges is the accurate 
measurement of these decision making factors. 

Further depth researches are required on creating the 
general and widely acceptable measurement of the response 
decision making factors. Further, it requires more research on 
the methodologies that are less dependent on accurate 
measurement of the decision making factors such as analytic 
hierarchy process (AHP). 
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Abstract — SQL Injection Attack (SQLIA) is a technique of code injection, used to attack data driven applications especially 
front end web applications, in which heinous SQL statements are inserted (injected) into an entry field, web URL, or web 
request for execution. “Query Dictionary Based Mechanism” which help detection of malicious SQL statements by storing a 
small pattern of each application query in an application on a unique document, file, or table with a small size, secure 
manner, and high performance. This mechanism plays an effective manner for detecting and preventing of SQL Injection 
Attack (SQLIA), without impact of application functions and performance on executing and retrieving data. In this paper we 
proposed a solution for detecting and preventing SQLIAs by using Query Dictionary Based Mechanism. 

Index Terms — SQL Injection Attack, SQL Injection Attack Detection, SQL Injection Attack Prevention, Query Dictionary. 

♦ 


1 Introduction 

S Tructured Query Language (SQL) [1, 2] is a standard, 
comprehensive language, based on the relational 
model, SQL includes capabilities of many functions. DDL 
statements for creating schemes and specifying data types 
and constraints. DML statements for specifying data re- 
trieves, and data modifications. SQL Language is a textu- 
al language that used on all relational database manage- 
ment systems (RDBMS), the most known and used are 
Oracle, Microsoft SQL Server, MySQL, PostgreSQL, DB2 
and SQLite 

SQL Injection Attack (SQLIA) [3] is a code injection tech- 
nique, used to attack data driven applications especially 
front end web applications, in which heinous SQL state- 
ments are inserted (injected) into an entry field, web URL, 
or web request for execution, used to gain unauthorized 
data, or to retrieve information from SQL relational data- 
base. SQLIA used most often to attack databases for re- 
trieving and extracting secret information such as credit 
card information, private information, user information's, 
and financial records. The highest risk application for 
attack is web applications, since web applications ac- 
cessed through internet and available for all internet users 
and devices, also mobile applications now are at highest 
risk for SQLIA. An application is vulnerable for SQLIA 
since the injection is legal for SQL standards, and DB en- 
gine execute it. The vulnerability exists at user inputs, in 
which they bypass validation or no validation at all and 
passed to dynamic SQL statement without validation and 
checking. If we are validating the user input, then with 
another way we are forbidden them to entering single 
and double quotes, multiple dashes, and SQL Language 
keywords in the input. 

Hackers have ability to input directly malicious queries 
via a web form or by directly insert it to the end of the 
URL or to URL variables or through HTTP headers. For 
example, if the query accepts username and passwords 


like this: 

"SELECT User_Name, User_FullName FROM TA- 
BLEJJSERS WHERE User_Name=' ' AND Us- 
er_Password- '; " 

The above query will select "UserName" from the table 
"TBL_USERS" by filtering using query search condition 
"User_Name" and " User_Password". Now we can ma- 
nipulate it by various SQL code snippets by just input 
them in User_Name and User_Password fields at web 
form or URL variables like Ahmad’ or T "= T 
When web form front end processes web form and gener- 
ates SQL statement to send it to DBMS, generated SQL 
query with above inputs will be: 

SELECT User_Name, User_FullName FROM TBL_USERS 
WHERE User_Name=' Ahmad' AND 
User_Password=" or T= T; 

Because of malicious input the query search condition is 
always true condition as the query is asking to retrieve 
User_Name and User_FullName with condition that Us- 
er_Name is Ahmad and USER_PASSWORD equal to " or 1 
= 1. We can also use SQL comments operator so SQL 
engine ignore the portion after comment operator, if Us- 
er_Name field input is Ahmad'-, this will manipulate que- 
ry search to just check condition on User_Name only, 
UNION can also lead to a successful SQL injection attack. 
Open Web Application Security Project [4] published that 
SQL Injection Attack (SQLIA) is the top one and most 
vulnerable among the top ten web application vulnerabil- 
ities. 

SQL Injection attack not limited for web applications, it 
could be on desktop applications, mobile applications. 
According to OWASP [5], according to reports on 2008 for 
SQL injection vulnerabilities, 25% of all vulnerabilities 
reported for web applications. 

In this paper we are proposing a solution for detection 
and prevention of SQL Injection Attack (SQLIA) using 
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Query Dictionary Based Mechanism, in which we will 
store all queries search portion patterns, then we compare 
query generated from web forms back end and compare 
with stored one, the result will show if form query is in- 
jected, based on result action taken. In section 2 we are 
talking about Web Application and SQL Injection attacks. 
Section 3 about Types of SQL Injection Attacks. Section 4 
about SQL Injection Attack Detection. Section 5 is a sum- 
mary of related work on SQL Injection detection and Pre- 
vention. Section 6 we are talking about our Proposed so- 
lution. In last section the conclusion. 


2 Web App and SQL Injection Attacks 


Web application is a computer application that located on 
a server and users request it using web browsers through 
World Wide Web abbreviated (WWW). Web applications 
requested using HTTP or HTTPS protocols. In early web 
application started to be static, with web technology de- 
velopment most of web applications now dynamic con- 
tent, this means its contents from a database. Client using 
browser by entering web application URL request a web 
application document by using HTTP methods "Get, 
Post, Put, Delete". Web application N-tier architecture 
consists of Presentation, Business/ Logic, and Data tiers. 
The most architectures used is 3-tier in which each layer 
can potentially run on a different machine and the three 
layers are disconnected as shown on Fig.l. 
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Fig. 1. 3-teir Web Application Architecture 
This architecture in which presentation layer exists on 
client machine which is displayed using browsers like 
Google Chrome, Mozilla Firefox, Microsoft Internet Ex- 
plorer. In addition, the ability of user for changing URL 
variables also input fields and weakness of client valida- 
tion and easy of validation bypassing allow hackers to 
use vulnerabilities of dynamic SQL queries generated at 
web application backend programming code. 

SQL Injections [6] are attacks by which an attacker makes 
changes on the structure of the original SQL query by 
inserting (injecting) additional SQL code in the input 
fields of the web form or desktop app form or on URL in 
order to acquire unauthorized access to the database. De- 
spite that vulnerabilities that drive to SQLIAs are well 
known and understood, they persist and continued to be 
available because of lack of effective solutions and tech- 
niques for detecting and preventing them. SQLIA is a 
hacking technique in which attacker makes modifications 
on SQL statements through web form or application form 
inputs or web form URL variables or hidden fields to ac- 
cess unauthorized resources. Weakness of input field and 
URL variables validation help hacker to success. Web 
application vulnerabilities is the main cause of SQL injec- 
tion, the most of these vulnerabilities are: 


A. Weakness in input validation: this the common 
vulnerability in which no input validation for web 
form input fields or URL variables, so this allow 
hacker to add SQL code easily. 

B. Generous privileges: when web application access a 
database need a user with specific privileges, for ex- 
ample privileges for reading data, modification of 
data includes insertion, updating and deleting, privi- 
leges for DDL like creating tables, dropping tables. 
So the weakness here to use a general user that have 
all privileges, so any SQL statement this DB user can 
execute. So here if attacker bypass authentication he 
gains access to all DB user privileges, for example he 
can drop any table. 

C. Uncontrolled variable size: variable sizes that un- 
controlled and generic specially the biggest domain 
of them like String, lead to an easy way for attacker 
to alter SQL query with many characters the variable 
contains. 

D. Error message; the generated error messages by 
backend server code may return to client, these mes- 
sages may contain database name, tables name and 
attributes, etc., this information help hacker to know 
the structure of database. So error messages should 
not be shown to client and should the web applica- 
tion send it to webmaster by email or audit it in a log 
file. 

E. Dynamic SQL: SQL queries that dynamically gener- 
ated on backend code, these queries generated by 
concatenating SQL where condition attributes with 
variable values from input field or hidden fields or 
URL variables. In dynamic SQL the most research 
focus since no way to prevent using it, and it should 
not infect with SQLIAs. 

F. Client-side only control: if web application web 
forms validation depends on client side only, this is 
vulnerable, since hacker can bypass validation and 
validation scripts at client can be altered by using 
cross-site scripting. 

G. Stored procedures (SP): SP is an assigned name for a 
set of SQL statements and logic of procedures that 
compiled, verified and stored in database server, 
and it controlled through database server security. 
SP is more secure than web form dynamic generated 
query. The vulnerability to use dynamic generated 
SQL statements and use database function like EXEC 
to execute generated query, in this case it is vulnera- 
ble same with web form dynamic generated query. 

H. Input Output file support: if database user has priv- 
ilege to execute input form file or output file, then it 
will allow hacker to execute any statement that out- 
put to text file or excel file, for example MariaDB and 
MySQL "SELECT INTO OUTFILE...". 

I. Multiple statements: database user privilege for ex- 
ecuting multiple statements allow hacker to use UN- 
ION and retrieve additional information, or he can 
add additional insert statement or delete statement 
or drop table statement. 

J. Sub-selects: supporting of sub-selects or sub-queries 
lead to vulnerability, so additional SQL query can be 
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added inside WHRE condition. 

There are code practices [7] should be followed to reduce 
SQLIA, the most important of these practices are: 

A. Manual Coding Practices Defense: here developer 
learn SLQIA techniques and how to prevent them on 
coding stage, these practices divides to four catego- 
ries. Using Parameterized Queries or Stored Proce- 
dures, this will reduce vulnerabilities on dynamic 
query generation by concatenating, and replace val- 
ues with placeholders (parameters) with values. Al- 
so stored procedures can check of parameters data 
types and hide query structure from attacker. And 
developers should avoid using dynamic generation 
of queries in Stored procedures. The second category 
is Escaping, which is a technique for elimination 
SQL keywords. Each Programing language or script 
language has suitable connector to DBMS and it has 
its own escaping functions embedded in their librar- 
ies, as an example MySQL connector for PHP has 
mysql_real_escape_string() function. Third category 
Data Type Validation , here developer should use suit- 
able data types and he should check and validate in- 
puts with data types. Last group is White List Filter- 
ing [8], by filtering allowed and legitimate key 
words, then check for list to accept and execute. 

B. SQL DOM: [9] the solution is an executable "sql- 
domgen", which executed with connection to data- 
base and generate a compiled Dynamic Link Library 
(DLL) file. This file used by developer to execute 
against database. DLL file contains classes refer to 
them with SQL Domain Object Model (SQL DOM). 

C. Parameterized Query Insertion: by using this tech- 
nique, SQL queries vulnerabilities is detected inside 
source code and replaced with secure parameterized 
Structured Query Language (SQL) queries. 

3 Types of SQL Injection Attacks 

There are different methods performed together or se- 
quentially depending on attacker goals. Lor an effective 
and succeeded SQLIA, attacker should add a command 
with right syntax to the original SQL query. SQLIAs [6,10] 
classified to: 

A. Tautology. 

B. Illegal/ Logically Incorrect Queries. 

C. End of Line Comment. 

D. Timing Attack. 

E. Union Queries. 

L. Blind SQL Injection Attacks. 

G. Piggy-Backed Queries. 

Lor clarifying these types of SQLIAs I will use an example 
of a web form that contains two input fields Username & 
Password and a login button as shown in Pig. 2 
In this example we use below URL 
HTTP: / / www.anydomian.com?page=login to request 
login page. We use Username "Ahmad" and Password 
"P@ssw0rd", after Ahmad click on Login button, at 
backend web form code that connects to database to veri- 
fy that Ahmad account is available and correct. If SQL 
query return "True' Ahmad will be redirected to his ac- 


count main page, if "Palse" a message will appear from 
him telling him a wrong username or password. Lor a 
more reading of code read it from [10]. Now we will dis- 
cuss the seven types "methods" of SQLIAs and show how 
an attacker access Ahmad account main page without 
knowing the correct full Account information, in our ex- 
ample, the username and password of "Ahmad" account. 


SIGN IN TO CONTINUE. 


Ahmad 

m 


P@sswOrd 

a 

Remember IMe 

Forgot your password? 

Login 


Pig. 2. Login Page 

A. Tautology 

This SQLIA attack injects to SQL query so query 
evaluated to "True" always. 

Injected Query: 

SELECT User_Name, User_PullName 
PROM TABLE_USERS 

WHERE User_Name=’ Ahmad' AND Us- 

er_Password=" or T= T; 

B. Illegal/Logically Incorrect Queries 

This type of SQLIA collect database information 
from making page return error messages from 
backend code. Attacker inject junk input to URL or 
input fields or SQL query tokens to produce syntax 
or logical errors. In our example attacker inject to 
URL variables a single quote. 

HTTP: / / www.anydomian.com?page=login / 
Injected Query: 

SELECT PAGE_LOC PROM TBL_PAGES WHERE 
Page_ID= login' 

This injection will fire a syntax error when generat- 
ing dynamic query that return location of login 
page form database and the error will show: 

Error: Invalid Query “SELECT PAGE_LOC PROM 
TBL_PAGES WHERE Page_ID=login'” 

C. End of Line Comment 

In this type of SQLIA attacker use SQL comment 
operator "- - "to ignore part from SQL query 
search. 

In our example attacker insert for Username input 
field "Ahmad'--'" and Password "12345" 

Injected Query: 

SELECT User_Name, User_PullName 
PROM TABLE_USERS 

WHERE User_Name=' Ahmad’- AND Us- 

er_Password='12345'; 

D. Timing Attack 

An inference attack. In this type attacker make tim- 
ing between web page responses. This technique 
used "IP-Then" conditional statement for queries 
injection and "WAITPOR" to make database delay 
query response by a specific time. 
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E. Union Queries 

This type attacker appends a new query to original 
one using SQL UNION keyword, so he can access 
to unauthorized data. In our example attacker can 
inject a union query to URL: 

HTTP:/ / www.anydomian.com?page=login / union 
all select UserName from TBLJJSERS' 

Injected Query: 

SELECT PAGE_LOC FROM TBL_PAGES WHERE 
Page_ID=' login' UNION ALL SELECT 
USERNAME FROM TBLJJSERS 
This injected query will return all user names 
stored in table TBL_USERS which is not authorized 
to page navigator to access to this information. 

F. Blind SQL Injection Attacks 

An inference attack, as we talked one of best code 
practices to hide error messages from shown to cli- 
ent. So in this case attacker does not have any error 
messages since developer make error to a generic 
web page error. It difficult for attacker now to 
make SQLIA but it does not impossible. Attacker 
can request True/ False requests from SQL queries 
and he could success and steal information. 

G. Piggy-Backed Queries 

In this SQLIA type, attacker use SQL statements 
delimiter Attacker append additional state- 
ment so he can execute more that query. In our ex- 
ample attacker could add another query to URL: 
HTTP: / / www.anydomian.com?page=login / ;DRO 
PTB ALE TABLE USERS 7 
Injected Query: 

SELECT PAGE_LOC FROM TBL_PAGES WHERE 
Page_ID= , login l ; DROB TABLE TABLEJJSERS 
So here in this case first SQL query is legal, but the 
second is illegal and will fire database to drop table 
TABLEJJSERS. 

4 SQL Injection Attack Detection 

There are many techniques used for SQLIA detection [2, 
7], we will present them: 

A. SQLUnitGen 

Abbreviation for "SQL Injection Testing Using Static 
and Dynamic Analysis. This technique proposed by 
Shin and fellow workers. It uses static analysis to 
track flow of user inputs for testing attacks. Most 
tools and techniques utilize "JCrasher" which is a 
tool used to obtain test cases upon generated attack 
inputs. 

B. MUSIC 

Abbreviation for "Mutation-based SQL Injection 
vulnerabilities checking". This technique proposed 
by Zulkemine. He used mutation method based on 
error checking and catching by injecting syntax er- 
rors to check if any misshapen occurred. Then by 
comparing output it can conclude if a query contains 
misshapen and vulnerabilities. 

C. SUSHI 

It is an abbreviation which stands for "string con- 


straint solver". It proposed by Fu and Li. It is a recur- 
sive algorithm that found it very help in finding 
complex SQLIAs. It Solves SLSE (Simple Linear 
String Equation) constraint in an effective approach. 

D. Ardilla 

A technique and a tool for creating SQLIA. It pro- 
posed by Kiezun and fellow workers. This tool gener- 
ates attacks as inputs and run the application for each 
attack input. So it can check and detect the SQLIA 
from generated attack inputs. 

E. String Analyzer 

Wassermann and Su proposed this technique. Their 
solution depends on a based grammar algorithm, it 
strategizes string values as context free grammar 
(CFGs) and operations based on transducers of lan- 
guage following minimization. This technique then 
labels user input strings and summarize them and 
find contexts. Then by regular languages and context 
free languages usage, it checks the security of each 
labeled string in aspect of syntax. 

F. PHP Miner 

It is a solution rather than a tool, it proposed by Khin 
Shar and Kuan Tan. This solution statically looks for 
attributes in source code, then produces models and 
flowcharts of vulnerabilities prediction. 

G. Vulnerability and Attack Injection 

A method proposed by Fonseca and fellow workers, 
the solution upon attack application by pragmatic 
SQL injection vulnerabilities. For getting more 
pragmatic results the solution used predefined col- 
lected data from actual attacks. The technique com- 
posed of two parts that work together, a tool for in- 
jection attack and another for injection of vulnerabil- 
ity. 

5 Related Work 

Deevi Radha Rani, B.Siva Kumar, L. Tar aka Rama Rao, 
V.T.Sai Jagadish, M.Pradeep [3]. They proposed a tech- 
nique that handles all SQLIAs types. The technique upon 
encryption of user information and using of stored pro- 
cedures. They apply that on users' authentication infor- 
mation (Username, Password). They encrypt user data 
with AES algorithm using 40-bit secret key. On user regis- 
tration, his info encrypted and stored as a chipper text in 
database. On user authentication, on back end code at 
login form called stored procedure with parameters 
"Username, Password, Secret Key". Stored procedure 
encrypt Username & Password using secret key, after that 
it compares the generated encrypted username and 
password with encrypted username & password saved at 
users table. This technique is not suitable for dynamic 
queries from various tables since encryption of big data 
will consume time and size. But it is very valuable for 
injection attacks on user authentication. 

Biji.K.P [11] proposed data dictionary based mechanism 
against SQLIA. The method for detecting ant prevention 
SQLIA using a combination of DDL & DML Mapping. 
She creates a new database image as a mirror from prin- 
cipal database. Mirror database contains schema structure 
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and data contents of SQL queries implemented in web 
application forms, which will be stored in parallel. She 
generates a formula which it is a combination of DDL & 
DML Mapping along with Vectorization of SQL Queries. 
The Vectorization of SQL queries stored in a new created 
tables in mirror database, for including different syntax. 
She resolves the parse tree of different generated queries. 
She monitors the detection of abnormalities among the 
queries within production database from the result of the 
output of the different generated queries. For SQLIA de- 
tection shed used two methods. Static method which is 
known as pre-generating approach. In static method de- 
velopers follow some guidelines and validation checking. 
The second method is Dynamic approach which is known 
as post-generated approach, a technique used in run time. 

It analysis dynamic or runtime generated SQL query from 
web form after user inputs or web form request. 

Inyong Lee a, Soonki Jeong b, Sangsoo Yeoc, Jongs ub 
Moond [12]. They proposed a simple, easy and effective 
technique for detecting SQLIAs based on static and dy- 
namic analysis and by taking of attribute values at 
runtime (Dynamic Analysis) and compare it with original 
one in which also removed attribute values (Static Analy- 
sis). The technique used for numeric attributes and string 
attributes. They create an algorithm for attribute values 
removal from query. Also they create a generalized 
SQLIA detection algorithm to check if the query at web 
forms is normal or abnormal in advance. 

Debabrata Kar, Suvasini Panigrahi [13] proposed a tech- 
nique for SQLIA detection using query transformation 
and hashing. Their technique to transform the original 
query parameter values "where condition parameter" 
with question mark symbol "?", and SQL keyword to 
uppercase keywords, system objects like table names and 
column names with keywords they proposed. So with 
this transformation they reduce number of different que- 
ries structure, also this will reflect on performance of 
search. They used hashing function for generating unique 
hash key, so the search will be efficient during runtime. 
The advantages of using hashing is the size of hash key 
will be smaller than the transformed query, so size need- 
ed in storage reduced. Also the same hash will be primary 
index, as they are unique, to facilitate fast and efficient 
searching at runtime. 

R.Latha, Dr.E. Ramaraj [14] proposed a technique for de- 
tection of SQLIA by replacement of query search condi- 
tion attributes string of original query used in web form 
with symbols they proposing like "PQ, GQ, STR, NUM, 
etc.". At runtime they are making a replacement of query 
search condition attributes for both the original query and 
dynamic generated query from web form after user in- 
puts. So they have now a two generated restructured que- 
ries. They compare the two restructured queries for 
SQLIA detection by measuring the distance between the 
two restructured queries using levenstein method. This 
technique satisfies both static and dynamic analysis. 

Swapnil Kharchel, Jagdish patil, Kanchan Gohad, Bharti 
Ambetkar [15]. They proposed an efficient technique and 
algorithm for detection and prevention of SQLIAs using 
Aho-Corasick pattern matching algorithm. Their pro- 
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posed technique has two phases, static phase and dynam- 
ic phase. In static phase they create a list of known anom- 
aly pattern, and SQL queries that checked by enforcing 
static pattern matching algorithm by comparing of known 
anomaly pattern list created. During runtime and using 
dynamic phase if new anomaly is occurring, then new 
anomaly will be generated and added to static anomaly 
pattern list. On new anomaly generation score calculated 
for the query, if the score greater than a determined 
threshold then the query passed to an administrator to 
analyses the query manually, if the query infected a new 
anomaly generated and added to static anomaly list. 

6 Proposed Solution 

We propose an effective solution for SQL Injection detec- 
tion and prevention without any impact on application 
functions and performance. This solution based on a Que- 
ry Dictionary Mechanism. Our solution general view fo- 
cus on: 

A. SQL query statements numbers. 

B. SQL query has UNION 

C. SQL Query where suffix pattern. 

To save this information about each query, many ap- 
proaches can be used. It could be generating a memory 
allocation at application start, so this information can be 
collected for all queries exists on the application start one 
time, or it could be collected on first query calling and 
appended to memory allocation. For memory allocation 
we propose to create application variable that contains a 
list of objects to save query information on, the allocation 
created below using C# language and ASP. NET web ap- 
plication, 
class SQLIA_DP 
{ 

public int Id { get; set; } 
public string Query_Caption { get; set; } 
public byte Query_Statemnts_Count {get; set;} 
public bool Query_Has_Union { get; set; } 
public string Query_Pattern { get; set; } 

} 

List<SQLIA_DP> Is = new List<SQLIA_DP>; 

At Global class, in Application_Start method 
we create an application variable that holds 
the Is instance of query information, the 
statement for creating is: 

Application [ "SQLIA_DET_PREV" ] = Is; 

Another approach to save query information is in a JSON 
file or in NoSQL Database for example MongoDB, the 
format as following: 

[ "Id": 1, 

"Query_Caption" :"loginfrm", 
"Query_Statements_Count": 1, 

" Query_Has_Union" : "FALSE", 

"Query_Pattern" : 

" WHEREUser_Name=ANDUser_Password=" 

] 

Another approach to save query information on any rela- 
tional database table, it could be on same application da- 
tabase or in a different database, table structure will be: 
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CREATE TABLE [dbo] . [TBL_SQLIA_DET_PREV] ( 

[Id] [BIGINT] IDENTITY (1,1) PRIMARY KEY, 
[Query_Caption] [VARCHAR] (15) , 
[Queny_Statements_Count] [TINYINT] NOT NULL, 
[Queny_Has_Union] [BIT] NOT NULL, 
[Query_Pattern] [VARCHAR] (1000) NOT NULL ); 
Another approach to save query information on XML file 
as shown in Fig. 3. 

1 |c?xml version^T.B" encoding= ’’UTF-S" ?> 

2 - < SQLI A_DET_P REV > 

3 t Cquery) 

4 <id>K/id> 

5 < Que r y_Capt i on > log i nf rm < /Qu e ry_C a p t ion > 

6 T <Query_Statements_Count>i</Query_Statemnt,s_Count> 

< Qu e ry_Ha s_U n i on >f a 1 s e < /Qu e ry_H a s_Urt ion) 

< Que ry_Patt e r n > WH E R E Us e r_N ame=AND Os e r_P a s 5 wo rd = < / Qu e ry_Patt e rn > 

9 < /query) 

10 < / SQL IA_DET_P REV > 

"Fig. 3 XML Format for SQLIA_DET_PREV 
For Query Pattern extraction, it could be generated and 
inserted manually by developers or database designers, 
or developers can use below proposed algorithm that 
automatically extract query pattern and insert it to 
SQLIA_DET_PREV list, or developers could use this algo- 
rithm at runtime. The algorithm for Query Pattern extrac- 
tion as following and as shown on Fig. 4: 

STEP 1: Take Dynamic generated SQL Query without 
values from source code. 

STEP 2: Check if UNION key word exists and 

STEP 3: Count semicolon times which represents number 

of SQL statements in Query. 

STEP 4: Split SQL Query by "WHERE" key word. 

STEP 5: If Splitted SQL Query Output Array has more 
than one item then next steps for second items in Array, if 
it has one item then next steps for First Item (One Item 
means Query does not have where statement) 

STEP 6: Remove single quote and values between from 
chosen array item. 

STEP 7: Remove each value after SQL Equal Operator "=" 
and before first Space. 

STEP 8: Remove all Spaces. 

STEP 9: Save Needed information on SQLIA_DET_PREV 
list, if Semicolon times is zero then save it one. 

So our example query "SELECT User_Name, Us- 
er_FullName FROM TABLE_USERS WHERE Us- 
er_Na m e = 1 A h m ad ' AND User_Password=T2345’" 

Does not has UNION, zero semicolon, after splitting and 
execute steps from 5 to 9, Query Pattern will be "WHERE 
User_Name=ANDUser_Password=", values saved as 
shown in Fig. 3, since semicolon times is zero, this mean 
the query consist of one statement. For "Query_Caption", 
this field can be used for query retrieve to increase search 
performance, so developer can use it the same for exam- 
ple "loginfrm" for queries in login form as shown in Fig. 3 
so I linked it with web form class which can extracted 
dynamically. 

Above extraction algorithm could be used static or dy- 
namic, depends on application and developer needs. On 
application run and after user enter the inputs send his 
request and we assume here user is an attacker and he 


injected SQL query. Query after its dynamic generation 
and before sending to database engine for execution 
should send to SQLIA_CHECK algorithm which de- 
scribed as following: 

STEP 1: Use Query Pattern Extraction Algorithm above to 
extract new dynamic generated query with parameter 
values. 

STEP 2: Create SQLIA_DP object 

(SQLI A_DP_CURRENT) . 

STEP 3: Get Query Pattern object saved at 

SQLIA_DET_PREV List, if not available it should be gen- 
erating using Query Pattern Extraction Algorithm and 
save it to (SQLI A_DP_ORIGIN) . 

STEP 4: Compare Query_Statemnts_Count on 

SQLIA_DP_CURRENT and SQLIA_DP_ORIGIN, if result 
is equal GO TO STEP 4, if not Return 1 and Exit. 

STEP 5: Compare Query _Has_Union on 

SQLIA_DP_CURRENT and SQLIA_DP_ORIGIN, if equal 
GO TO STEP 5, if not Return 1 and Exit. 

STEP 6: Compare Query_Pattern on 

SQLIA_DP_CURRENT and SQLIA_DP_ORIGIN, if equal 
Return 0 and Exit, if not Return 1 and Exit. 

Above Algorithm return value 1 means there is an 
SQLI A, so query execution should be canceled. If return 
value 0 then query is clean and it should be send to data- 
base engine for execution. 

In our example if attacker inject a query "SELECT Us- 
er_Name, User_FullName FROM TABLE_USERS 
WHERE User_Name=' Ahmad' AND 
User_Password=" or T= T; 

The generated query will be send to SQLIA_CHECK AL- 
GORITHM, the result explanation will be as following: 
STEP 1: Query Pattern will be 

"WHEREUser_Name=ANDUser_Password=or=" and no 
UNION key word and 1 statement, this info saved to 
(SQLIA_DP_CURRENT) object. 

STEP 3: Get Saved Query Pattern from List, this will re- 
turn, 1 statement, no UNION, "WHERE- 

User_Name=ANDUser_Password=" and saved to 
(SQLIA_DP_ORIGIN) object. 

STEP 4: Compare result is equal Go to Step 5 
STEP 5: Compare result is equal Go to Step 6 
STEP 6: Compare Query Pattern is not Equal, Algorithm 
return 1 so there is an SQLIA and Query does not for- 
ward to database engine. 

7 Conclusion 

In this paper we have presented an effective SQL Injec- 
tion Attack detection and prevention without any impact 
in application functions and performance. Our proposed 
solution used static and dynamic approaches. Easy to 
implement by developers and database designers or de- 
velopers. Our solution detects all types of SQLIAs. Upon 
application needs or and developer experience or and 
application sensitive degree it could be implemented for 
part of queries or for all queries, it could be implemented 
static or dynamic. Query information extracted could be 
stored in encrypted manner to make the solution more 
secure. As a future work we could implement our solu- 
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tion and calculate performance issues and compare it 
with other solutions. 




Fig. 4 Query Pattern Extraction Algorithm 
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Abstract — Most of intrusion detection researches suffer from the 
following drawbacks: Dependencies between network nodes and 
cluster-like behavior of anomalies. Hence, this paper proposes a 
cluster-based approach in which the anomalies are detected using 
a new criterion related to the behavior of attacks. In addition, we 
provide a cluster-based data set which uses the flow-based data 
and graph properties to model the network traffic over time. The 
data set is built over the DARPA. Moreover, the anomalies are 
revealed by means of a criterion which is computed from internal 
and external weight of clusters. Finally, the proposed approach is 
evaluated and compared to other approaches. The evaluation 
results show the preference of our approach relative to other 
ones. 

Keywords- Anomaly; DARPA data set; flow; graph clustering; 
intrusion detection 

I. Introduction 

Intrusion detection systems (IDS) are divided into packet- 
based and flow-based categories. In packet-based IDSs, all of 
network packets which are passing through a desired point are 
collected and analyzed. Basically, a packet consists of two 
parts. One part is packet header that is including the 
information about packet source and destination and the other 
part is the content of a packet which includes data. In these 
IDSs, both of these parts are investigated to detect the 
anomalies. In contrast, the flow-based IDSs are based on the 
network flows. One of the important properties of this method 
is that it doesn’t include the content of the packet, however, it 
contains just the information such as the source and destination 
addresses. Therefore, flow-based IDSs increase the speed of 
intrusion detection process and are suitable for high speed 
networks that it solves the scalability problem in the network 
security [1], [2], [3]. 

In more specific, one of the approaches for solving the 
scalability problem in packet-based IDSs is packet header 
extraction approach. Mahoney et al. [4] proposed an approach 
by means of the packet header extracting such that the anomaly 
is detected using the normal values learning for each packet 
header. In addition, Manandhar et al. [5] proposed an approach 
that used the traffic data. They checked the information of 
packet header for anomaly detection. Also, Karimpour et al. [6] 
proposed a flow-based clustering algorithm to detect attacks in 


DARPA data set. They used some proper time intervals and 
threshold points to reveal the attacks in high accuracy. 

As comparing of these two methods, the flow-based IDS is 
more suitable than packet-based one in high speed networks. 
Moreover, in these approaches, flow data are analyzed instead 
of the contents of packets. In 2010, Sperotto et al. [7] devised 
an approach based on the flows in the network and used the 
time series to reveal the attacks. In this study, the performance 
of flow-based IDS in comparison with the packet-based one in 
the network is proved and a data set is proposed to evaluate the 
flow-based IDSs. 

Further, Hellemons et al. [8] proposed another method that 
is based on the flow concept in 2012. This research includes 
two parts. In first part, a high performance algorithm is 
designed for intrusion detection. In second part, a prototype of 
the IDS has been implemented. In fact, the authors proposed an 
algorithm to detect dictionary attack. The algorithm splits the 
attacks into 2 or more phases. The criteria that are used in the 
algorithm are packet per flow criterion and minimum number 
of flows which are calculated in 1 minute time intervals. Based 
on this algorithm, threshold points are considered in each phase 
of the attack. They assumed that the dictionary attack has 3 
phases: scan, brute-force and die-off phase. They detected the 
attack in high accuracy mode by applying threshold points to 
those three phases. 

Also, Graph-based intrusion detection systems are the type 
of security approaches using the properties of the network, not 
the content of packets. These systems detect the intrusion by 
analyzing the network graphs that can detect the high 
scalability attacks such as the worms. In this study, Zhou et al. 
[9] proposed an approach to use the graph concept for 
implementing the multi variable time series and their relation in 
each time, Iliofotou et al. [10] in 2007 proposed an approach to 
monitor and analyze the network traffic using traffic dispersion 
graph (TDG). They defined the traffic dispersion graph as the 
graphic presentation of interactions among the groups of nodes. 
The advantage of using the traffic dispersion graph is its power 
to presenting the structural relations of the attacks. Another 
approach is proposed by Le et al. [11] in 2011 based on the 
graph theory fundamentals such as degree of nodes, maximum 
degree of graph and similarity distance of graph. 
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One of the important fields in the graph-based intrusion 
detection is graph clustering method. In this field, Muniyandi et 
al. [12] used decision tree to optimize k-means algorithm for 
intrusion detection systems. Mingqiang et al. [13] in 2012 and 
Yin et al. [14] in 2014 also used the graph clustering to detect 
the attacks. Mingqiang et al. assumed three sets for normal, 
pending and anomaly clusters and placed the data in N clusters 
and the clusters are placed in 3 types of defined clusters based 
on the rate of the normal and anomaly data that are already 
specified. The existing clusters in the set of pending clusters 
are placed in one of the normal or anomaly groups using local 
deviation coefficient. Yin et al. has optimized this approach 
and used local deviation factor instead of the local deviation 
coefficient. 

These approaches consider the dependencies between 
nodes, but they don’t consider the cluster-based behavior of 
attacks in the network traffic graph and don’t define an 
appropriate criterion to detect the attacks. This paper proposed 
a new criterion based on the graph clustering and considered 
the cluster-like behavior of attacks. The best results of 
detecting attacks are concluded by applying the appropriate 
time intervals and threshold points to the proposed cluster 
criterion in time series. 

II. Proposed approach 

This paper proposes an approach to increase the detection 
rate, decrease the false alarms rate and eliminate the restrictions 
in detection of all types of attacks. Hence, an anomaly-based 
intrusion detection approach along with the flow and graph 
concepts are utilized. To prepare the data for the proposed 
method, it is necessary to perform some preprocessing on the 
initial traffic which is described in the next subsections. Then, 
our proposed approach is explained in details. Finally, the 
approach and its strength and weak points are discussed. 
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A genetic-based graph clustering algorithm [16] is used in 
the suggested approach in which the flows are considered as 
the inputs (graph edges) to the algorithm. The following steps 
are included in this step: 


(1) In first step, an initial population is created. The 
encoding phase of the algorithm is based on random numbers 
generation such that the length of a chromosome is equal to the 
number of graph nodes that are encoded by the random 
numbers from 1 to the number of nodes which indicate the 
cluster’s number. 

(2) In second step, the fitness function is computed that it 
consists of 2 parts: 

(2. a) Internal communication in the clusters: The internal 
communications in each cluster is obtained as follows: 


A “N? 


( 1 ) 


In the equation 1, pi and Ni represent the normalized weight 
of edges and the number of nodes in i-th cluster, respectively. 
So Ni2 represents the total number of edges with maximum 
weight in i-th cluster. 

(2.b) External communication in the cluster: The external 
communications between the clusters are calculated using the 
equation 2: 

f0 ifi = j 

Eu = \'U (2) 


= f f « 


ifi*j 


The equation 2 represents the number of communications 
between i-th and j-th cluster. In this equation, sij represents the 
normalized weight of edges between i-th and j-th cluster and 
2NiNj expresses the maximum weight of edges between 2 
clusters. 

Therefore, the fitness function is obtained from equation 3: 

^ v , >1 

MQ=\ i S&T Vfe>1 (3) 


A. Preliminaries 

In the preliminary stage, the flows are extracted from the 
packet-based traffic. Then, these flows are clustered using the 
graph clustering algorithm in several time series that is based 
on genetic algorithm. These steps are described in sections 
2.1.1 and 2.1.2. 

A.l. Flow simulation 

In this step, the packets with common properties are placed 
in the same flow and also the number of packets for each of the 
created flow is calculated. Now, we have some flows with 1 
second time intervals such that we can use them in different 
time series. A 7-tuple is used to indicate the simulated flow 
[15]: 


U i fc = i 

(3) Selection phase of genetic algorithm is performed using 
the selection function and roulette wheel. 

(4) 2-point crossover and swap mutation are performed. 

(5) New population and new generation are produced using 
the replacement operation. 

After graph clustering, some properties such as sending time, 
number of clusters and nodes, number of internal and external 
packets, number of internal and external flows are calculated in 
the time series to create a new criterion. Thus, the cluster-based 
data set [17, 18] with mentioned properties are obtained. As an 
example, some time series of created data is presented in Fig. 1 
that is related to DARPA data set and the network traffic in 4th 
week, Tuesday [18]. 


(IP-Src, IP-Dst, Pcks, Time, Port-Src, Port-Dst, Prot) 

The items of the 7-tuple represent source and destination IP 
addresses, number of packets related to the flow, the sending 
time of the flow in the network, source and destination port 
numbers and protocol type, respectively. 

A. 2. Graph clustering of the flows 
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Figure 1. An example of cluster-based data 
B. Proposed criterion 

The appropriate criteria will be described using the 
explained concepts and the properties that are related to the 
graph clustering data set. Thus, the best criterion is needed to 
achieve a system with high accuracy, high true alarms rate and 
low false alarms rate. 

In more specific, the number of internal flows is defined to 
represent the number of connections among the IPs (nodes). To 
achieve an appropriate internal criterion in the time series, and 
according to the behavior of some attacks such as IP scan, it is 
expected that the average weight of internal flows represents an 
increasing value. We define this criterion in equation 4. 

lTf=o IntPckti 

Cl “ S? =0 /nt/JoWi (4 ) 

In this equation, IntPckti and Intflowi represent the number 
of internal packets and internal flows in i-th time bin, 
respectively. 

In addition, the cluster with minimum number of external 
communications shows the nodes that are connected to less 
number of other clusters. In this situation, the nodes of 
mentioned cluster will be considered as an attacker and the 
other nodes that are connected to these nodes will be assumed 
as victims. To achieve this criterion, the following step is 
performed in the time series: the number of external flows of 
each cluster is calculated and the cluster which has the 
minimum number of external weight is selected. We represent 
this as Ei in each time interval (equation 5). In this equation, 
ExtPckti, Extflowi and Nodes represent the number of external 
packets, external flows and nodes in i-th time interval, 
respectively. 

ExtPckti 

1 Xjkg Ext f low i Nodes ^ 

By analyzing the behavior of some other attacks such as DoS 
[19], it is expected that in these types of attacks, there exist 
external flows among the most of clusters and the sum of 
external weights of other clusters are more than the most value. 

The final criterion is achieved by dividing the Ci into Ei. 
According to the mentioned cases, if the value of this criterion 
is rather than a threshold point, an anomaly has been occurred. 
This criterion is calculated based on the equation 6: 


III. THRESHOLD POINTS AND TIME INTERVALS 

In this paper, a static way is used to identify the best 
threshold point based on comparison between the normal and 
anomaly points. Therefore, 4th week data of DARPA is 
analyzed for some time intervals and threshold points. 

To access the highest detection rate and also, minimum false 
alarms rate, the data is tested in several time intervals and 
threshold points. Four time bins are considered: 45, 60, 75 and 
90 seconds. For these 4 time intervals, a threshold point is 
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identified for the introduced criterion. Also, the evaluation 
rates of intrusion detection methods are calculated. 

In 45 seconds time interval, at first, the traffic is split to the 
equal time intervals (45 seconds) and the time series are created 
in 45 seconds time bins (ti, t 2 , t n ) such that U indicates the 
first 45 seconds, t2 the second 45 seconds and t n the n-th 45 
seconds. Then, each of these intervals is tested in the final 
model. 

Tables 1-4 represent the results of the time windows such 
that four appropriate threshold points are obtained for intended 
criterion and the evaluation rates are identified due to it. The 
observations represent that the threshold points at 55, 50, 75 
and 75 have the maximum detection rate in the time intervals, 
respectively. 

After applying the final model on four different time 
windows, the evaluation rates are obtained according to the 
appropriate threshold points and the best threshold points are 
selected for each of time windows. Then, to investigate that 
which of the time bins have the maximum detection rate and 
minimum false alarm rate, the results of selected threshold 
points are compared. As we see in table 5, 75 seconds time 
interval has the best results in comparison with the other ones. 
Also, other time windows have high detection rates, however, 
75 seconds has the highest accuracy relative to others. 
Therefore, if the results checked based on the detection rate, it 
is clear that the detection rate decreases with the increase in the 
time window, but if the results checked based on the false 
alarm rate and the detection rate, it is not concluded that the 
smaller time window always obtains the best result. 


TABLE I. Evaluation rate in 45 seconds time window 


Threshold 

False 

True 

False 

True 

Detectio 

point 

positive 

positive 

negative 

negative 

nrate 

>35 

0.154 

0.902 

0.098 

0.846 

0.874 

>45 

0.107 

0.873 

0.127 

0.893 

0.883 







>65 

0.077 

0.881 

0.119 

0.923 

0.902 


TABLE II. Evaluation rate in 60 seconds time window 


Threshold 

False 

True 

False 

True 

Detectio 

point 

positive 

positive 

negative 

negative 

n rate 

>30 

0.086 

0.896 

0.104 

0.914 

0.905 







>70 

0.057 

0.870 

0.130 

0.943 

0.914 

>90 

0.081 

0.923 

0.077 

0.919 

0.921 


TABLE III. Evaluation rate in 75 seconds time window 


Threshold 

False 

True 

False 

True 

Detectio 

point 

positive 

positive 

negative 

negative 

nrate 

>45 

0.078 

0.944 

0.056 

0.922 

0.932 

>60 

0.047 

0.935 

0.065 

0.953 

0.944 







>90 

0.056 

0.968 

0.032 

0.944 

0.956 


TABLE IV. Evaluation rate in 90 seconds time window 


Threshold 

False 

True 

False 

True 

Detectio 

point 

positive 

positive 

negative 

negative 

n rate 

>45 

0.070 

0.936 

0.064 

0.930 

0.933 

>60 

0.098 

0.928 

0.062 

0.902 

0.915 

>75 

0.043 

0.933 

0.067 

0.957 

0.946 
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0-941 | decreasing in the true positive rate in our approach towards the 

Manandhar’s approach. 


| >90 | 0.070 | 0.952 | 0.048 | 0.930 


TABLE V. Comparison of 3 time windows 


Time 

False 

True 

False 

True 

Detection 

interval 

positive 

positive 

negative 

negative 

rate 

45 

0.053 

0.921 

0.079 

0.947 

0.934 

60 

0.066 

0.918 

0.082 

0.934 

0.926 







90 

0.043 

0.933 

0.067 

0.957 

0.946 


According to the explained cases, we select 75 seconds time 
interval and use it to evaluate and compare the proposed 
approach to other methods. 

In table 6, the detection rates for each attack are shown; the 
maximum detection rates are related to the scan and the DoS 
attacks. 

The results of the best case for the proposed approach in 
comparison of the packet-based methods are shown in table 7 
based on the appropriate time window and threshold point. 
The results of the table show the performance of using the 
flow concept in our procedure such that in the detection rate of 
packet-based method is less than the flow-based one. 

In this part, the results of the methods before and after the 
clustering are compared. Hence, the detection rate, true 
positive and false positive rate are calculated before and after 
the clustering using the time window and appropriate 
threshold point. The best threshold point is 75 before the 
clustering and is 60 after the clustering. The results of this 
comparison are shown in table 8. 

The results show the performance of clustering approaches. 
According to the fixed false detection rate, the cluster-based 
detection rate has 25% increasing value that is significant and 
the detection rate increases 15% that is related to using the 
clustering approach in our methods. 

In this section, the results of proposed approach are 
compared with the other new methods on DARPA data set 
[18]. Also, false positive rate (false alarm), true positive 
(detection), detection rate are used for capability in detecting 
the types of attacks. The rate of capability to detect the types 
of attacks represents that the proposed approach is offered to 
detect which one of the attacks. 

Table 9 represents the comparison of the final results of our 
proposed method to the Manandhar’s approach which is used 
the packet header analyzing and just detect the TCP-based 
attacks. In the Manandhar’s approach, 2 results are obtained 
that one of them is based on the low false alarm rate and the 
other one is based on the high true detection rate. 

The results of table 8 show the performance of our approach 
in the detection rate and the false positive rate. The detection 
accuracy of our proposed approach has 20% increasing for the 
results with the high detection rate and 5% increasing for the 
results with the low false alarms. Also, in the proposed 
approach, the false positive rate has been decreased 5% to 
10% in the comparison to the other methods and the other 
approach has been increased 1% to 5% in the positive phase. 
The other approach can detect 85% of attacks, while our 
approach can detect 95% of them and it can be the reason of 


TABLE VI. Detection rate of types of attacks in 75 seconds time 

WINDOW AND THRESHOLD 75 


Attack 

type 

DoS 

Scan 

Local 

access 

User to 
root 

Data 

Total 

Detection 

rate 

0.973 

0.981 

0.922 

0.911 

0.904 

0.933 


TABLE VII. Flow -BASED VERSUS PACKET -BASED APPROACH 


Method 

False 

positive 

True 

positive 

False 

negative 

True 

negative 

Detection 

rate 

Flow-based 

0.091 

0.939 

0.061 

0.909 

0.924 

Packet-based 

0.199 

0.837 

0.163 

0.801 

0.819 


TABLE VIII. Before clustering versus after clustering method 


Method 

False 

positive 

True 

positive 

False 

negative 

True 

negative 

Detection 

rate 

After 

clustering 

0.056 

0.932 

0.068 

0.944 

0.938 

Before 

clustering 

0.288 

0.856 

0.144 

0.712 

0.784 


TABLE IX. Comparison of the proposed approach with the 
Manandhar's approach on DARPA 


Method 

False 

positive 

True 

positive 

Detection 

accuracy 

Rate of 
detection 
capability 

Our 

proposed 

approach 

0.014 

0.985 

0.990 

0.948 

Manandhar 
(low false 
alarm) 

0.081 

0.946 

0.936 

0.932 

Manadhar 

(high 

detection 

rate) 

0.066 

0.955 

0.928 

0.925 


IV. Conclusion 

In this paper, a new cluster-based data set was provided. It 
was built over the DARPA data set. We also proposed a new 
flow-based approach to detect anomalies. It uses a new 
criterion which is computed from internal and external weights 
of clusters of the network graph. The comparison results show 
that high detection accuracy is achieved through the proposed 
approach. As future work, we aim to consider new cluster- 
based criteria to achieve higher detection rates. 
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A Comparative Study of Smoothing a Vehicle’s 
Trajectory which is calculated by an Evolutionary 

Algorithm 

Bayram Ali BURAN 1 , Suleyman Hikmet CAGLAR 2 and Ozgur Koray SAHINGOZ 3 


Abstract — Determining a vehicle’s trajectory is a complex and 
hard to solve type problem in the literature and it is identified 
as a NP-Hard optimization problem which is studied in different 
engineering disciplines such as computer, electrical and industrial 
engineering. It has been observed that such complex problems 
can be solved by using various approaches and lots of them 
are focused on the usage of Evolutionary Algorithms especially 
in case of a large number of controls points which are needed 
to be visited. Although these algorithms provide near optimal 
solutions, in the real world, vehicles are not able to follow 
this determined path (trajectory) without any deviation. Because 
vehicles are moving objects and each one moves with a certain 
speed. Therefore it is impossible for a vehicle to make a sharp 
turn after visiting control points. These vehicles need to make 
smoothed turns over these points. Therefore there will be a 
certain difference between the calculated path and the real path. 
It is needed to determine the real path by using necessary 
mathematical solutions for smoothing these paths. To ensure 
the motion continuity of vehicles, they need to follow paths 
determined according to a certain criterion. In this study, the 
most common smoothing methods which are used to ensure these 
continuities (Bezier, B- Spline and Dubins) have been compared 
and it is aimed to show the different approaches in an application 
area of path planning problems as a comparative study. 

Keywords — Unmanned Aerial Vehicle , Path Planning Evolution- 
ary Algorithm , Bezier Curves; B-Spline Curves , Dubins Path. 

I. Introduction 

Trajectory Planning Problem can be defined as a com- 
putation of a trajectory of a single vehicle (or a group of 
vehicles) to reach the desired goal state by avoiding obstacles 
and collisions. Trajectory planning can also be described with 
different meanings in the literature such as path planning, 
motion planning, trajectory optimization, etc. The vehicle in 
trajectory planning can be used either as a car/robot or in a 
more complex way it can be used as an Unmanned Aerial 
Vehicle (UAV). 

Usage of various vehicle types results in different challenges 
in the optimization of trajectory due to their specific charac- 
teristics. For instance, a robot, a car or a quad-copter have 

1 B.A. Buran is with the Department of Basic Sciences, Turkish Air Force 
Academy, Istanbul, Turkey, 34149 e-mail: bburan@hho.edu.tr. 

2 S.H. Caglar is with the Department of Mathematics and Com- 
puter Science , Istanbul Kultur University, Istanbul, Turkey, 34149 e-mail: 
s.caglar@hho.edu.tr. 

3 O.K. Sahingoz is with the Department of Computer Engineering, Turkish 
Air Force Academy, Istanbul, Turkey, 34149 e-mail: sahingoz@hho.edu.tr. 
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the ability to return sharply from the corners and to stop and 
go backwards, however, a fixed-wing UAV has to fly with a 
minimum velocity not to fall to the ground. 

In this study, it is aimed to implement a trajectory planning 
methodology for various autonomous vehicle types. However, 
it is especially focused on applications and scenarios for UAVs 
as depicted in Figure 1. The trajectory can be constructed 
with different evolutionary approaches [1], [2] which will be 
detailed in the next section. However, these produced routes 
have sharp returns, and this cannot be done with most of 
the vehicle types such as UAVs or fast cars. Therefore, for 
a realistic planning, there is a need to determine a smoothed 
path according to physical constraints of the vehicle. In this 
study, it is aimed to construct a smoothed path by using mostly 
used smoothing methods such as Bezier, B-Spline, and Dubins. 
We try to make a comparative study of these methods and lead 
the researchers on this topic. 


Fig. 1. 3d Trajectory Planning for multiple UAVs 

In the literature mainly two types of trajectory plannings 
are studied: as two dimensional and three dimensional. In tra- 
jectory planning, environmental conditions (mountains, build- 
ings, trees, etc.) and properties of used vehicle are also our 
constraints which we might be encountered, and this increases 
the complexity of the problem also. 

A path planning on which vehicles could pass over the 
pre-determined points on the basis of performing the tasks 
autonomously is required. The number of control points can 
change according to the planned task, and that number some- 
times finds thousands. With an increasing the number of 
control points, the priority of visiting points is turning into 
an optimization problem in terms of time and total distance. 
In the literature review, we found that Evolutionary Algorithm 
is widely used and accepted method in many research studies 
to solve this problem [3], [4]. 
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After the determination of priority of the visiting the control 
points, by combining the points there occurs a path that is out 
of mobility of the vehicle. By smoothing that path, the path 
seems more realistic as in the real world. The smoothing effect 
of the vehicle’s path is seen in the Figure 2. In our literature 
review about this issue, we found that smoothing methods were 
acquired by Bezier Curves, B- Spline Curves, and Dubins Path 
[5]— [1 1] . 



Fig. 2. Vehicle Paths 

The outline of the paper is organized as follows. Section 
2 first presents some background knowledge about trajectory 
planning such as evolutionary algorithms and additional artifi- 
cial intelligence search techniques. The smoothed algorithms 
of the path are described in Section 3 by giving the details of 
the Bezier Curve, B-spline Curves, and Dubins curves. Section 
4 concludes the paper by presenting some discussions and 
future works. 

II. Trajectory Planning Algorithms 

Even though in Travelling Salesman Problem (TSP) type 
optimizations the primary part of path planning a kind of 
optimization problem it is a type of NP-Hard (Non-Polynomial 
Hard) problem which cannot be solved by traditional mathe- 
matical methods. In this type problems, there can be many 
number of the control point, which each vehicle need to visit 
them. However, if the number of control points increases, it 
cannot be solved with standard mathematical models. There- 
fore, in the literature, some additional approaches are used. 

Some researchers tried to use Voronoi diagrams to determine 
the trajectory of the vehicle by getting into account the distance 
of each point. Mainly in this type problems, points are defined 
as the center of the threat area. Therefore the vehicle tries to 
go as far as from these points. However, this type of solution 
is not acceptable for controlling some specific point in the 
mission theatre [12]. 

Therefore, evolutionary algorithms and swarm based opti- 
mization algorithms are used for solving the problem. Genetic 
algorithms are used especially for simply producing some 
solution and try to improve their quality with creating new 
generations [13]. With the usage of a specific operator such as 
crossover and mutation, the solution can be improved in each 
iteration. However, this is not sufficient for reaching the global 
best solution. This algorithm produces a near-optimal solution 
in its domain. 

Ant Colony optimization algorithm is preferred in some 
research, and in this approach inspired by the foraging behavior 
of real ant colonies. In this approach, each ant leaves a deposit 


pheromone on the ground (as an indirect communication) 
to mark the route for him and also for other ants of the 
colony [14]. This approach, and also the genetic algorithm, 
can be easily implemented in a parallel structure. Therefore, 
can be executed quickly. Also, some additional approaches like 
RRT and Bee Colony Optimization are preferred for solving 
the problem. These methods are not mathematical methods 
but heuristic methods. These methods are similar to iteration 
method which is used in mathematics. 

III. Smoothing Strategies 

Construction of a path can be solved by using some different 
type of algorithms. However in the real world, these predeter- 
mined paths cannot be followed by the vehicles due to their 
physical constraints such as velocity. Therefore, there is a need 
to smooth these calculated paths in a more feasible way. Robots 
and quad-copter type vehicles can follow their determined path 
because they can make sharp turns in their motion. However, 
in trajectory planning of UAVs or cars, velocity factor must 
be taken into consideration with additional physical factors 
such as turning angle, etc. In mathematics, there exist some 
smoothing strategies as Bezier, B-Spline, and Dubins. In the 
following section, these approaches are detailed. 

A. Bezier Curves 

Bezier Curves were developed by Pierre Bezier who is an 
engineer in order to design vehicle bodies. Using a specific 
number of points the curve segment can be approximated. 
These curves dont pass over the points except first point and 
last point. A Bezier Curve in N-th order is generated by N+l 
control point. Bezier Curves are generated by the construction 
called convex combinations because they are contained in the 
convex hull composed of control points [5]. 

The Bezier Curve is mathematically formulated based on 
Bernstein Polynomials. According to this basis function of n- 
th order is shown in Equation 1 by using i parameter for control 
points [6]. 

C 1 “ t T~ l ( 1 ) 

Curve, Pi i-th control point and basis function related to 
B(i,n ), is algebraic formulated in Equation 2. 

n 

C(t) = J2PiBi,n(t) (2) 

i = 0 

By using an adequate-order Bezier Curve, various figures 
can be produced. However, while the degree of the algorithm 
increases, the complexity also increases and the calculation 
process will take longer than expected. Because of the increas- 
ing complexity high degree curves are affected more due to 
the rounding errors. As a result of this, the relation between 
control points and curve decreases. 

Because of these adverse effects, it is common to form 
complex pathing figures by multiple Bezier Curves. Especially, 
hybrid use of second and third order Bezier Curves, according 
to the number of the control points, is more common. 
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Formulations of the Bezier Curves are presented in the 
ongoing part; Linear Bezier Curve is shown in Equation-3, 
Quadratic Bezier Curve is shown in Equation-4 and Cubic 
Bezier Curve shown in Equation- 5. 

^ 1,1 (t) = (1 — t) Po + tP\ (3) 

-^ 2,2 (t) = (1 — t) Po + 2 1 (1 — t) P% H- 1 2 P 2 (4) 


£ 3,3 W « (1 - tf P 0 + 3t (1 - tf Pi + 3 t 2 (1 -t)P 2 + t 3 P 3 (5) 

In the equation of Cubic Bezier Curve coefficients of P^i = 
0, 1, 2, 3 are terms of binomial expansion of the ((1 — t) + 1) 3 . 
This property is applicable for all Bezier Curves given by N+l 
control points. N+l control points Pi, i m 0, 1, N, coefficients 
of these points are terms obtained from binomial expansion of 
((1 — t)+t) N . This can be easily seen when analyzed Equation 
2 . 

In Figure 3 , there are the figures of a) Quadratic Bezier 
Curve and b) Cubic Bezier Curve samples in t G [0,1] interval 
and given with the tangent at t = 0, 5. See that given with 
curves lie within the convex hull generated by given control 
points. 


Pi 



a) Quadratic Bezier Sample 



When analyzed Figure 4 generated by combination of two 
figures in Figure 3 , Figure 4 doesnt comply with maneuver- 
ability constraint the vehicles. For the continuous curve, when 
we combine two curves , ending point of the first curve and 
starting point of the second curve must be same. 


Pa 



Fig. 4. Combination of two Bezier Curve 

But continuity is not sufficient for an effective path planning. 
Bezier Curve that is planned to have same tangent on the 
combination points, and they have C 1 continuity and more 
smooth combination. Because of the low probability of the 
getting curves with C 1 continuity spontaneously by inserting 
imaginary points the Bezier Curves with C 1 continuity are 
obtained as is seen in Figure 5. 



Fig. 5. Smooth Combination of two Bezier Curve 

According to problem type, there can be a need for the 
addition and insertion of some points to the curves. When 
analyzing the Quadratic and Cubic Bezier Curves in Figure 4, 
the tangents which are at the end points are created by using 
control points which are the ending point of the previous curve. 
C 1 continuity (in other words continuity of the first derivatives) 
can be accomplished if and only if the tangents at the point 
of junction must be same straight line. This condition can be 
made by inserting an imaginary point after the starting point 
of the second curve. 

The control points forming the first curve are: 

Pi,i = 0, 1 .i 

The control points forming the second curve are: 

Pj \j = i, i + 1, .k 

Let show the imaginary point that we want to insert the 
second curve with P^. Meanwhile because of inserting a new 
point, the order of the second curve will increase. 

Pi-i, Pi and P^ will be on the same straight line 
and as we know the priority of points, distance between 
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Pi-i and Pi (xi,yi) and between P* (xi,yi) and 

Pi * (a;**, I/**) can be formulated by a k parameter. We get the 
Equation 6. 

Pi - Pi_! = k (P« - P4 and P» = (6) 

By using the Equation 6. for each curve segment, a imagi- 
nary point is inserted just after the merging point with previous 
curve. Due to the new control points, it will be the same 
tangent in point of junction so C 1 continuity is provided. We 
are going to work on minimizing the effects of imaginary point 
on curve length and trajectory by choosing k parameter positive 
but very small. The effect of inserting imaginary points is 
shown with Berlin 52 TSP sample in Figure 6. Mainly this 
shown for the effect of smoothing operation. In the first figure 
the best path is calculated by using an evolutionary algorithm, 
and then it is smoothed by using a Quadratic Bezier Curve. 
However, as can be seen from the Figure 6. a this path cannot 
be used by a vehicles such as UAVs. Therefore the planned 
path must be in a continuous format. As can be seen in Figure 
6.b this path can be converted to a continuous curve by using 
some imaginary points. 



B. B-Spline Curves 

The first important studies on B- Splines basis functions 
were carried out about 70 years ago by Schoenberg after this 
fundamental algorithm was developed by Cox and de Boor. 
Many pioneer researchers in the field, Riesenfeld, Boehm, 
Schumder and more, proved that B-Splines were in the scope 
of CAGD-Computer Aided Geometric Design were available 
and attractive presentation method and they were feasible [7]- 

[9]. 

Also B-Spline Curves, like Bezier Curves, do not pass over 
all of the control points. They can be used in various orders. 
When Bezier Curves were generated by control points, like 
data points, it is necessary to make sure that derivatives are 
continuous. Also, if the place at a control point changes the 
curves will be affected in Bezier Curves by this change. B- 
Spline functions have basis functions which have partial effect 
in order to these disadvantages of Bezier Curves. These basis 
functions are zero out of their domain. Hence, the curve takes 
form according to a few control points that are close to it. The 
order of B-Spline Curves and number of the control points are 
unrelated [10]. 

Defined by k-order and n+1 control points P 0 , Pi, ...., P n 
B-Spline Curve C(u) f s mathematical formula is shown in 
Equation 7. 

n 

C(u) = J2 N MPi ( 7 ) 

i = 0 

Here u is defined at 0 < u < n — fc + 2. n-k+2 explains how 
many segments the curve has, as well. 

A B-Spline Curve with k-order generated by k-1 degree 
polynomial segments with C k ~ 2 continuity at break points. 

Through, to < t\ < t n+ k, a set of non-decreasing break 

points a knot vector determining parametrization of basis 
functions is defined. See the Equation 8 and 9. 

T= (t 0 ,t 1 ,....t n+ k) (8) 

0 < i < n + fc and ti knots are defined as in Equation 9; 


a) For Berlin 52 Sample Quadratic Bezier Curve 



b) For Berlin 52 Sample C 1 Continuous Quadratic Bezier 
Curve 

Fig. 6. The Effect of Insert Imaginary Points in Quadratic Bezier Sample 


( 0 if i < k 

U = <i — fc + 1 if k < i < n (9) 

(n — fc + 2 if i > n 

According to a given knot vector T, for fc = 1, Ni^(u) 
is combined B-Spline basis functions are defined as in the 
Equation 10. 

iv„ (»)={; ift 'f f " no f- +i uo) 

For k > 1 and z = 0,1,2, n is defined as in the Equation 

11 . 


Ni, k (u) 


-Nik-i M + *,*+* 7 ,“ N 




(u) nn 


Lets accept = 0 for Equation 1 1 not to be undefined. 
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Control points generating B- Spline Curves are called as 
Boor points. Basis function N^ k (u) and its knot vector, T = 
(^o?^i — t k — 1 5 tk •> — ^n—iPnPn+i — t n _|_/ c ), were defined 
by n + k + 1 element in other words sum of number of control 
points, n+1 and order of curve k, the Equation 5 and 8 . Each 
knot interval ti < u < ti + 1 was grafted on a polynomial 
curve between two serial correlations C (U) and C (U+%). 
Normalizing the knot vector, to cover [0,1] interval, increases 
numerical sensitivity of at changing points because of high 
density of number in this interval. 

When generating C(u) B-Spline Curve for n+1 control 
points, and k = 3 by using given Boor Algorithm with 
Equation 6 and 7, we can deduct Equation 12. 


C. Dubins Path 

The shortest path between two points under curvature 
constraint in the plane is expressed as the Dubins Path. 
This method was developed by Lester Eli Dubins in 1957 
by using Analytic and Differential Geometry method. By 
using Pontryagin's maximum principle the researchers proved 
that this method meant the shortest path. This method is a 
special type of Euler-Lagrange equation. The shortest path 
in this method is generated by adding maximum curvature 
and straight lines to circular arcs. Using the Dubins Path in 
Nonholonomic vehicles, wheeled robots, air crafts, underwater 
vehicles is common. To calculate the length of parametric 
curves Equation 17 integral is used. [11] 


C (u) = ^ (i + 1 - uf Pi + ^ [(u - i + 1) (i + 1 - u) + 

(i + 2-u)(u — i)\Pi+ 1 + \(u- if P i+ 2 

( 12 ) 

1 < i < n — k + 2 and i < u < 1 are accepted. 

If we change of variable to u f = u — i in Equation 12, then 
the ”0 < u' < 1” becomes a constraint for Equation 13 is 
obtained. 

C(u') = | [(1 - u'f Pi + (-2m' 2 + 2 u’ + 1) P i+1 + M ,2 P i+2 ] (13) 

Now all curve segments are obtained from the change of u 
in [0, 1] interval. Again change of variable to u instead of u is 
done and when the Equation 13 is transform into matrix form, 
it can be deducted the Equation 14. 



There are minimum turning radius p, rotation 0 and initial 
point (x s , y s ) E R 2 by coordinate in plane, equation of tracing 
is shown in Equation 18. 

X = Vq cos 0 

< y = vo sin# (18) 

6=^U, u e [o, l] 

Where is constant velocity and u is normalization pa- 
rameter. [15]. The shortest path that corresponds to maximum 
curvature particularly for an air plane, can be either CLC or 
CCC (C:Circle, L:Line) paths or a subset of these. C refer to 
circular arc and L refers to straight line which is tangent to C. 
See Figure 6 . 


C{u) = - [u 2 


i -2 r 
-2 2 0 
1 1 0 


Pi 1 

Pi 

Pi+1. 


(14) 


For i E [l,n — 1], each curve segment is obtained by 
changing u variable in [0, 1] interval. Because of k=3 2-nd 
degree equation is obtained and dimensions of matrixes are 
seen to be related to number 3. Consequently, C{u) B-Spline 
Curve in relation with k parameter can be shown in Equation 
15. 



Fig. 7. CLC and CCC Types of Dubins Path 


C(u) = U k M k P k (15) 

So by choosing k=4 the Formula of Cubic B-Spline Curves 
can be represented as in Equation 16. 

-1 3 -3 

3 6 3 

-3 0 3 

1 4 1 

Although smoothness of the path created by B-spline Curves 
is one of the advantage of the algorithm. However the distance 
between the curve and the internal points is seen a main 
disadvantage. In this study, it is aimed to minimize the distance 
between the curve and internal points with some specific 
constraints. 


C (u) = [' u 3 u 2 u l] 


" 

[Pi-ll 


Pi 


Pi+1 


Pi+ 2 _ 


( 16 ) 


A Dubins Path requires the use of the following parameters 
as depicted in (Figure 7) 

• Initial Point : P s (x s ,y s , 0 S ) 

• Final Point : Pf(xf,yf,Qf) 

• Initial Curve : 

• Final Curve : 

Consequently, it will be enough to enter the coordinates of 
initial and final points of Dubins Path. 

Dubins Path finds fastest and shortest path, but the method 
also has disadvantages like the direction of the path, which 
is forward, and the increase of sub problems related the 
number of vehicles. The method mentioned here are open for 
improvement. In literature reviews, there is no research show in 
which method is the most advantages with constraints, though. 
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Fig. 8. Dubins Path Made in two Dimensions by Using Circular Arcs and 
Straight Line , Segments 

Consequently, it will be enough to enter the coordinates of 
initial and final points of Dublin’s Path. Dubins Path finds 
fastest and shortest path, but the method also has disadvantages 
like the direction of the path, which is forward, and the 
increase of subproblems related the number of vehicles. The 
method mentioned here are open for improvement. In literature 
reviews, there is no research show in which method is the most 
advantages with constraints, though. 

IV. Discussion and Conclusion 

In this paper, it is aimed to plan a path of a vehicle, such 
as a UAV, by using some evolutionary algorithms. However, 
after executing these algorithms, the produced path cannot be 
traceable by the vehicles. Therefore, it is needed to smooth 
the produced by using some mathematical approaches. In 
this paper four different methods were investigated and the 
comparison of them are presented in Table 1 to give a prior 
opinion for the researchers. 


TABLE I. Comparison of Studied Methods 


Methods 

C° 

Continuity 

C 1 

Continuity 

Degree 

Density 
of Points 
and Curve 

Function 

Type 

Bezier C. 

Yes 

Yes 

High 

Distant 

Polynomial 

Hybrid 
Bezier C. 

Yes 

No 

Low 

Close 

Polynomial 

B-Spline C. 

Yes 

Yes 

Low 

Close 

Polynomial 

Dubins Path 

Yes 

Yes 

Intermediate 

Close 

Trigonometric 


As can be seen from this table, researchers must be careful 
about their aims in selecting the smoothing algorithm. The 
depicted table shows the advantages and disadvantages of 
them. For realistic path planning, especially with autonomous 
vehicles with high speed, like Unmanned Aerial Vehicles 
(UAVs), these detailed curves with their mathematical back- 
ground must be used. In this paper, three types of curve models 
were examined, and their advantages and disadvantages were 
shown. Also, necessary mathematical formulas were presented 
as a source for researchers studying on this subject. 

For further studies, it is aimed to implement all these meth- 
ods in a real world scenario such as UAV path planning which 
has a relatively high speed than autonomous robot cars. Firstly, 
a path will be constructed by using an evolutionary algorithm, 
such as genetic algorithm, and then this produced curve will be 
smoothed by using all of these compared algorithms. Finally, 
the total real costs will be calculated and compared. 
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